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Abstract. Let a and ui be locally finite positive Borel measures on R. Subject to the pair 
of weights satisfying a side condition, we characterize boundedness of the Hilbert transform 
H from L 2 (a) to L 2 (w) in terms of the A2 condition 



duj (x) 



\I\ + \X-Xi\ 



da (x) 



<C\I\, 



and the two testing conditions: For all intervals / in 



J^H(l If j){x) 2 duj(x) < C J^da{x), 



H(liU))(xyda(x) < C du(x), 
11 Ji 

The proof uses the beautiful corona argument of Nazarov, Treil and Volberg. There is a 

range of side conditions, termed Energy conditions; at one endpoint, the Energy conditions 

are also a consequence of the testing conditions above, and at the other endpoint they are 

the Pivotal Conditions of Nazarov, Treil and Volberg. We detail an example which shows 

that the Pivotal Conditions are not necessary for boundedness of the Hilbert transform. 
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1. Introduction 

We provide sufficient conditions for the two weight inequality for the Hilbert transform. 
Indeed, subject to a side condition, a characterization of the two weight L 2 inequality is 
given. 

For a signed measure u on K define 

(1.1) Hco (x) = p.v. / — *— u(dy) . 

J x-y 

A weight w is a non-negative locally finite measure. For two weights u, a, we are interested 
in the inequality 

(1.2) WH^m^^Wfh^y 

See Definition 1.19 below for a precise definition of p.v. and the meaning of (1.2). The two 
weight problem for the Hilbert transform is to provide a real variable characterization of the 
pair of weights u, a for which inequality (1.2) holds. 

What should such a characterization look like? Motivated by the very successful A p theory 
for the Hilbert transform in Hunt, Muckenhoupt and Wheeden [HuMuWh], one suspects that 
the weights should satisfy the two weight analog of the A 2 condition: 

If If 

sup — / u(dx) ■ — I a{dx) < oo . 

i \i\ Ji m Ji 

As it turns out this two weight A 2 condition is not sufficient. The suggestion for additional 
necessary conditions comes from the Tl theorem of David and Journe 'MR763911 and the 
two weight theorems of the second author for fractional integral operators, [Saw3]. These 
conditions require the following, holding uniformly over intervals /: 

(1.3) JlHiUa)] 2 co(dx) < H 2 a(I) , 

(1.4) J\H(l x u;)\ 2 a(dx)<{n*)Ml)- 

Here, we are letting H and H* denote the smallest constants for which these inequalities are 
true uniformly over all intervals J, and we write a(I) = Jj a(dx). 

Clearly, (1.3) is derived from applying the inequality (1.2) to indicators of intervals. One 
advantage of formulating the inequality (1.2) with the measure a on both sides of the in- 
equality is that duality is then easy to derive: Interchange the roles of u and a. Thus, the 
condition (1.4) is also derived from (1.2). We call these 'testing conditions' as they are de- 
rived from simple instances of the claimed inequality. Also, we emphasize that duality in this 
sense is basic to the subject, and we will appeal to it repeatedly. 

In a beautiful series of papers, Nazarov, Treil and Volberg have developed a sophisticated 
approach toward proving the sufficiency of these testing conditions combined with an im- 
provement of the two weight A 2 condition. To describe this improvement, we define this 
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variant of the Poisson integral for use throughout this paper. For an interval I and measure 



;i.s) 
;i.6) 



P(/,w) 



(|/| +dist(a;,/)) 2 



u)(dx) 



supP(I,w) • P(I,cr) 
i 



Al < oo . 



The last line is the improved condition of Nazarov, Treil and Volberg. We will refer to (1.6) 
as simply the A 2 condition. F. Nazarov has shown that even this strengthened A 2 condition 
is not sufficient for the two weight inequality (1.2) - see e.g. Theorem 2.1 in [NiTr]. 

The approach of Nazarov, Treil and Volberg involves a delicate combination of ideas: 
random grids (see [NTV2]), weighted Haar functions and Carleson embeddings (see [NTV3]), 
stopping intervals (see [Vol]) and culminates in the use of these techniques with a corona 
decomposition in the brilliant 2004 preprint [NTV4]. Theorem 2.2 of that paper proves 
the sufficiency of conditions (1.6), (1.3) and (1.4) for the two weight inequality (1.2) in the 
presence of two additional side conditions, the Pivotal Conditions given by 



;i-7) 



oo 

E 



w(/ r )P(/ r , U,ff) 2 < PV(/„), 



(and its dual) where the inequality is required to hold for all intervals Jo, and decompositions 
{J r : r > 1} of Iq into disjoint intervals I r C I . As a result they obtain the equivalence 
of (1.2) with the three conditions (1.6), (1.3) and (1.4) when both weights are doubling, and 
also when two maximal inequalities hold. Our Theorem below contains this result as a special 
case. 

In our approach, we replace the Pivotal Condition (1.7) by certain weaker side conditions 
of energy type. We begin with a necessary form of energy. 



Definition 1.8. For a weight u, and interval I , we set 



E uj(dx') x x 



1 1/2 



It is important to note that E(J, oS) < 1, and can be quite small, if u is highly concentrated 
inside the interval I; in particular if ulj is a point mass, then E(I, u) = 0. Note also that 
\I\ 2 E(I,u) 2 is the variance of the variable x, and that we have the identity 

E(/, W ) 2 



1 E uj(dx)yuj{dx') (X - X'Y 

2 1 1 \I\ 2 



The following Energy Condition is necessary for the two weight inequality: 
1.9) ^(/ r )E(/ r , W ) 2 P(/ r ,al /o ) 2 < £V(/ ), 
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where the sum is taken over all decompositions Iq = IJ^Li A- °f the interval Iq into pairwise 
disjoint intervals {I r } r>1 . As E(I,u) < 1, the Energy Condition is weaker than the Pivotal 
Condition. 

As a preliminary sufficient side condition, we consider the geometric mean of the pivotal 
and energy conditions: for < e < 2 we say that the weight pair (cu, a) satisfies the Hybrid 
Energy Condition or simply Hybrid Condition provided 



where the sum is taken over all decompositions Iq = U^Li Ir- When e = 2 this is the necessary 
Energy Condition and when e = this is the Pivotal Condition. A corollary of our main 
theorem is that if the weight pair (u,a) satisfies the Hybrid Condition (1.10) and its dual 
for some e < 2, then the two weight inequality (1.2) is equivalent to the A 2 condition (1.6) 
and the testing conditions (1.3) and (1.4). 

Later in this paper we exhibit a weight pair (cj, a) satisfying (1.3), (1.4), (1.6) and the Hy- 
brid Conditions for some e < 2, but for which the dual Pivotal Condition fails. In particular 
this shows that the Pivotal Conditions are not necessary for the two weight inequality (1.2). 



1.1. An optimal condition. Now we describe an optimal — for the method of proof — 
sufficient side condition. First it is convenient to introduce two functionals of pairs of sets 
that arise. 

Definition 1.11. Fix < e < 2. We define the functionals 



Note that $ (I r , Iq) appears in the sum on the left side of the Energy Condition (1.9), and 
$ (^J,I \ I*\ appears again on the right side of the dual Energy Estimate (6.7) below. The 

larger functional \1/ (I r , Iq) appears in the sum on the left side of the Hybrid Condition (1.10). 

It turns out that one can replace \l/ in the proof below with any functional, subject to three 
properties holding. We now describe the three properties required of the functional 

Set e(I) = {a, b, ^} to be the set consisting of the endpoints and midpoint of an interval 
/ = [a, b}. We say that a subpartition {J r } of / is e-good if 



(1.10) 




u(J)E(J,u) 2 P(J,l E a)\ 
w(J)E(J,u;) e P(J,l £ ;a) 2 . 



(1.13) 



dist(J r , e(I)) > 



J r \ £ \I 



l-e 
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For 7 > and e > 0, and for all pairs of intervals Iq C / in V a we require 
r *(I ,/ )< ^ 7 Vr(/o), 

V >x * (J r , J ) < J^ £ cr(/ ), for all subpartitions {I r } of J , 

(1.14) 1 " 

E r >! $ (Jr, / \ Io) < BUp r ^ (^) 7 * (lo, /) 

for all e-good subpartitions { J r } of J . 

Note: It is important to note that the second line requires us to test over all subpar- 
titions {I r } of I Q . In the third line we need only test over the e-good subpartitions, 
but must include differences I \ Iq of intervals in the argument of $ on the left side. 

When \I/ is given by (1.12), the first line in (1.14) is the usual A 2 condition, the second line 
is the Hybrid Condition (1.10) with e = 7, and the third line is proved in Lemma 2.19 below. 

For fixed 7, £ > 0, there is a smallest functional \l/ 7je satisfying the third line in (1.14), 
namely 

(1.15) % j£ {I,E)= sup 

^U s >! J. 

where the supremum is taken over all £-good subpartitions { J s } of the interval /. Note that 
\l/ 7ie (/, E) becomes smaller as either 7 or e becomes smaller, and also as E becomes smaller. 
The functional \l/ 7ie also satisfies the first line as we see by taking E = Iq and the trivial 
decomposition I\ = Iq. Then the second line in (1.14) becomes 

(1.16) ^/^ t£ (I r , I ) < J r ^ e cr(/ ), for all subpartitions {/ r } of I . 

This condition (1.16), which we call the Energy Hypothesis, thus represents the optimal side 
condition that can be used, along with its dual version, with the methods of this paper (all 
three lines in (1.14) hold and the third line is optimal). From Lemma 2.19 and the optimal 
property of ^/ 7 , e , we see that the Hybrid Condition (1.10) implies the Energy Hypothesis 
(1.16) with 7 = 2 - e - 2e > 0. 

Theorem 1.17. Suppose that u and a are locally finite positive Borel measures on the real 
line having no point masses in common, namely uj ({x}) a {{x}) = for all x £ M. Suppose 
in addition that for some 7 > ; and < e < 1 we have both Energy Hypothesis constants 
T lfi and T* finite. Then the two weight inequality (1.2) holds if and only if 

• the pair of weights satisfies the A 2 condition (1.6); 

• the testing conditions (1.3) and (1.4) both hold. 



inf 

S>1 



\I\ 
X 
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Remark 1.18. The reader can easily check that Theorem 1.17 holds if the infimum inf s >i (jj^j 
in (1.15) is replaced by inf s >! 77 I t-^t ) for a suitable Dini function rj on [1, 00). 

— V \ Jr,s' I J 

The quantitative estimate we give for the norm of the Hilbert transform is given in (5.1). 
Consider now the conjecture of Volberg [Vol] that the two weight inequality holds if and only if 
the A2 and testing conditions hold. Since the Energy Condition (1.9) is actually a consequence 
of the A 2 and testing condition (1.3), Volberg's conjecture would be proved if we could take 

7 = in Theorem 1.17. (That we can take e > follows from the general techniques of §4.) 
There are subtle obstacles to overcome in order to achieve such a characterization. 



We will follow the beautiful approach of Nazarov, Treil and Volberg using random grids, 
stopping intervals and corona decompositions. Energy enters into the argument at those 
parts based upon the smoothness of the kernel, see the Energy Lemma, especially (6.6) 
below. Much of the argument we use appears in Chapters 17-22 of the CBMS book by 
Volberg [Vol], with the final touches in the preprint of Nazarov, Treil and Volberg [NTV4]. 
In order to make this complicated proof self-contained, we reproduce these critical ideas in 
our sufficiency proof below. 

In §10, we exhibit a pair of weights which satisfy the two- weight inequality, as they fall 
within the scope of our Main Theorem, yet they do not satisfy the Pivotal Condition of 
Nazarov- Treil- Vo lb erg . 

The main novelty of this paper is that (1) the energy condition is necessary for the two 
weight testing conditions, (2) the Energy Hypothesis can be inserted into the approach of 
[NTV4], and (3) that the Pivotal Conditions are not necessary for the two- weight inequality. 

The integral defining H(af) in (1.2) is not in general absolutely convergent, and we must 
introduce appropriate truncations. The following canonical construction from [Vol] serves 
our purposes here. 

Definition 1.19. Let ( be a fixed smooth nondecreasing function on the real line satisfying 

((t) = fort<^ and ((t) = 1 for t > 1. 

Given e > 0, set ( £ (t) = £ (|) and define the smoothly truncated operator T e by the absolutely 
convergent integral 

T £ f(x) = [ Ce(\ x ~ y\)f(y)d<7 (y) 1 f £ L 2 (a) with compact support. 

J y-x 

We say (1.2) holds if the inequality there holds for all compactly supported f with T e in place 
of T, uniformly in e > . 



One easily verifies that all of the necessary conditions derived below can be achieved using 
this definition provided u and a have no point masses in common (note that if uj — a = 



TWO WEIGHT NORM INEQUALITY 



7 



8 X , then (1.2) holds trivially with this definition while (1.6) fails). Moreover, the kernels 
^z^Ce{\ x ~ y\) °f T e are uniformly standard Calderon-Zygmund kernels, and thus all of the 
sufficiency arguments below hold as well using this definition. In the sequel we will suppress 
the use of T £ and simply write T. 

2. Necessary Conditions 

In this section, we collect some conditions which follow either from the assumed norm in- 
equality or the testing conditions. These are the A 2 condition, a weak-boundedness condition, 
and the Energy Condition. The principal novelty is the Energy Condition. 

2.1. The Necessity of the A 2 Condition. In this section we will give a new proof of this 
known fact due to F. Nazarov: 

Proposition 2.1. Assuming the norm inequality (1.2), we have the A 2 condition (1.6). Qual- 
itatively, 

M=\\H{-a)\\ L2{a) ^ LHu) >A 2 

The analogue of this inequality in the unit disk was proved for the conjugate operator in 
[NTV4] and [Vol] Chapter 16. We provide a real-variable proof here. 

Proof. Fix an interval I and for a G R and r > let 



si (x) = 





I\ 




\I\ 


+ 


X 


— Xi\ 



fa,r (y) = l(a-r,a) (y) Sj (y) , 

where xi is the center of the interval I. For y < x we have 

\I\(x-y) = \I\(x - xi) + \I\(xi - y) 

< (j/l + lx-x/Dd/l + 1^-3/1), 

and so 

>\I\~ 1 si(x)si(y), y<x. 

x-y 

Thus for x > a we obtain that 

f a 1 

H [fa,r<r) (x) = / — ^— si (y) da (y) 

J a—r % V 

> |/| _1 s/(x) / si (yf da (y) . 
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Applying our assumed two weight inequality (1.2) in the sense of Definition 1.19, and then 
letting e > there go to 0, we see that 

-2 f°° 2 ( f a 2 V 

\I\ / si(x) / si(y) da(y)\ du (x) 



a—r 



< \\H(af a , r )\\ 2 L2[ui) <^ 2 \\faAl H<7) = M* f sj(y) 2 da(y). 

J a—r 

Rearranging the last inequality, we obtain 

poo pa 

| J|- 2 / Sl (xf du (x) / sj (yf da (y) < Af 2 , 

J a J a—r 

and upon letting r — > oo, and taking a square root, 

i 

(2.2) ( sj (x) 2 dco (x) r sj {yf da (y)) ' < M \I\ . 



The ranges of integration are complementary half-lines, and clearly we can reverse the role 
of the two weights above. 

Choose a 6 R which evenly divides the L 2 (cr)-norm of Sj in this sense: 

f 1 2 f°° 2 1 f°° 2 

(2-3) / si (y) da (y) = si (y) da (y) = - / s/ (y) da (y) , 

J —oo Ja ^ J —oo 

and conclude that 

oo /*oo r-a poo 

S!{x) 2 duj(x) Sl (y) 2 da(y)= s/ (x) 2 du (x) / sj (yf da (y) 

oo J —oo J —oo J —oo 

poo roo 

+ / s I (xfdu(x) I sj(yfda(y) 

J a J —oo 

/a poo 
Sl (xfdu(x) / sj(yfda(y) 
-oo J a 

oo 



+ 2 s T (x) du(x) / si(y) da (y) 

J a J —oo 

<A/- 2 |/| 2 . 

Dividing through by |/| 2 , and forming the supremum over / concludes the proof in the case 
where we can choose a as in (2.3). 

We now consider the case where a point masses in a prevents (2.3) from holding. If we 
replace a by a + e in (2.2), and then let e — > this gives 

/ si (xf du (x) f Sl (yf da (y) < M 2 \I\ 2 . 

J (a,oo) J {— oo,a] 
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The ranges of integration are complementary half-lines, and clearly we can reverse the role 
of the open and closed half-lines, as well as the role of the two weights, resulting in four such 
inequalities altogether. 

Now choose a G R to be the largest number satisfying 

2 1 f°° 2 

si (y) da (y)<- / s z (y) da (y) . 

(— oo,a) ^ J — oo 

Of course it may happen that strict inequality occurs in (2.1) due to a point mass in a at 
the point a. In the event that this point mass is missing or relatively small, i.e. 

a({a}) Sl (a) 2 <^A, 

where A = sj (y) 2 da (y), we can conclude that at least one of the integrals J, ^ x sj (y) 2 da (y) 
or f, ^ sj (y) 2 da (y) is at least \A. Suppose that the first integral a ) sj (y) 2 da (y) is at 
least jA, and moreover is the smaller of the two if both are at least \A. Then we also have 
I[a oo) Sl (y) 2 d* 7 (y) ^ where we have included the point mass at a in the integral on the 
left. We can now repeat the argument of (2.4) to conclude this case. 

It remains to consider the case that the point mass at a is a relatively large proportion of 
the Poisson integral, i.e. 

2 1 f°° 2 

a ({a}) sj (a) > - J s/ (y) da (y) . 
But then, consider the two universal inequalities 



/ Sl (x) 2 dtu(x) [ Sl (y) 2 da(y) < J\f 2 \ I\ 

J (a,oo) J (— oo,a] 

/ sj(x) 2 du(x) I Sl {y) 2 da{y) < Af 2 \I\ 

J (— oo,a) J [a,oo) 



('/) "<M//> ,C A" 

(a,oo) J (~ c 

(//) '^(//) ^ A - |/' 2 

(~oo,a) J [a,oo) 

Both integrals against a include the point mass at a, hence they exceed \ J™ sj (y) 2 da (y). 
It is our hypothesis that u and a do not have common point masses, so we conclude the A2 
condition in this case. 

□ 

Remark 2.5. Preliminary results in this direction were obtained by Muckenhoupt and Whee- 
den, and in the setting of fractional integrals by Gabidzashvili and Kokilashvili, and here we 
follow the argument proving (1.9) in Sawyer and Wheeden [SaWh], where 'two-tailed' in- 
equalities, like those in the A 2 condition (1.6), originated in the fractional integral setting. 
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A somewhat different approach to this for the conjugate operator in the disk uses confor- 
mal invariance and appears in [NTV4], and provides the first instance of a strengthened A 2 
condition being proved necessary for a two weight inequality for a singular integral. 

Remark 2.6. In the proof of the sufficient direction of the Main Theorem 1.17, we only need 
'half of the A 2 condition. Namely, we only need 



sup 



■P(/,cr) < 00 



along with the dual condition. This point could be of use in seeking to verify that a particular 
pair of weights satisfies the testing conditions. 

2.2. The Weak Boundedness Condition. We show that a condition analogous to the 
weak-boundedness criteria of the David and Journe Tl theorem is a consequence of the A2 
condition and the two testing conditions (1.3) and (1.4). 

For a constant C > 1, let Wc be the best constant in the inequality 



(2.7) 



Hfacr) u(dx) 



where the inequality is uniform over all intervals I, J with dist(J, J) < \I\ + | J\ and C^ 1 < 
J\ < C. The exact value of C that we will need in the sufficient direction of our Theorem 
depends upon the choice of e > in the Energy Hypothesis. It is therefore a constant, and 
we will simply write W below. 

Proposition 2.8. For C > 1, we have the inequality 

W < min {H,U*} + C'A 2 . 



Proof. To see this we write 
#(l/<7) dco = 



Jl 



H(I i( j) du + / H{l I( j) du + / H(l I( j) du, 



Jc 



Jr 



where 



Jl 
Jc 
Jr 



{x E J \ I : x lies to the left of /} , 

Jni, 

{x G J\I '■ x lies to the right of /} . 



Now we easily have 



H{lia) du 



Jo 



< V^Jcj 



\H{l l( r)\ 2 du 



Jc 
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< UlHilja)? du 

The two remaining terms are each handled in the same way, so we treat only the first one 
fj H(lj)a du. We will use Muckenhoupt's characterization of Hardy's inequality [Muc] for 
weights u and a: if B is the best constant in 

pa / fx \ 2 

(2.9) 
then, 
(2.10) 



B m sup 

0<r<a 



du 



da 



We will give the proof here assuming that u and a have no point masses, as the general case 
is hard. 

Without loss of generality we consider the extreme case Jl = (—a, 0) and / = (0, b) with 
< a < b. We decompose I — I\ U 1% with I\ = (0, a) and I 2 = (a, b). First we note the easy 
estimate 



/ H(l h a)du < f (7- 

Jj L J -a \Ja V 



-da (y) du (x) = u(J L ) / -da (y) 



< u{J L )JaJh) (j j 2 da{y) 

y/u{J L )a{I 2 )sJ^^P{h,<r) < 2A 2 y/u{J L )a{I 2 ), 



< 

since Jl and Ji are touching intervals of equal length a. Then we use (2.10) for the other 
term: 



f H(l h a) du = [[ 



+ 
I + 11 



!{-x>v} d(r (y) du(x) 

(-0,0) x (0,0) y ~~ x 

1{- X <y} da (y) du(x) 

(-a,0)x(0,a) V X 



These two terms are symmetric in u and a so we consider only the first one /. We have 
letting z = —x and du (z) = du (—z) and du (z) = ^du (z), 



Jo 



y + z 



da (y) du (—z) < 



a x 



* Jo 



dadu (z) 
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< 



< B 



duo \ I da ) du (z) 
u Jo \Jo 



ra 

du x I da 

-a JO 



By/u(J L )a(h), 



upon using (2.9) with / = 1. Finally we obtain B < A2 from (2.10) and 

1 



duo I da 
'0 



rduo x 



< P((0,r),S)x 



da 



1 



da 



(o,r) 



P(Ko) )W )x- / ^<^ 2 2 . 



(0,r) 

Thus we have proved W < "H + C^4. 2 - We obtain W < "H* + C^4 2 by applying the above 
reasoning to 



J HiUa) duo = Jh(1ju) da. 



□ 



2.3. The Energy Condition. We show here that the Energy Conditions are implied by the 

A 2 and testing conditions. 

Proposition 2.11. We have the inequality £ < A2 + "H , and similarly for 8*. 

The Energy Hypotheses with 7 > are the essential tools in organizing the sufficient proof. 
The proof begins with this Lemma. 

Lemma 2.12. For any interval I and any positive measure v supported in R \ I, we have 

Hv (x) - Hv (y) 



(2.13) 



P (J; v) < 2 |/| inf 

x,y€l x — y 



For specificity, in this section, we are re-defining the Poisson integral to be 

u(I) l/l f 1 



(2.14) 



P (/; „) = 



\z — Zi\ 



;V (dz) 



with zi the center of /. Note that this definition of P (/; v) is comparable to that in (1.5). 
Note that H (ljci/) is increasing on I when v is positive, so that the infimum in (2.13) is 
nonnegative. 
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Proof. To see (2.13), we suppose without loss of generality that I = (—a, a), and a calculation 
then shows that for —a < x < y < a, 



Hv{y)-Hv{x) = [ (— —) 

Jr\i l*-y z-x) 



v{dz] 



(y — X) I 7 r-; rV(dz) 

1 {z-y){z-x) y ! 



>\{y-x) [ \v{dz) 
4 Mi z 2 



since (z — y) (z — x) is positive and satisfies 



1 1 

> 



(z — y) (z — x) Az 2 

on each interval (— oo, —a) and (a, oo) in R \ I when —a < x < y < a. Thus we have from 
(2.14), and the assumption about the support of z/, 

Hu (y)-Hv (x) 



< 2\I\ inf 



x,y£i y — x 

□ 

Proof of Proposition 2.11. We recall the energy condition in (1.9). Fix an interval Jo, and 
pairwise disjoint strict subintervals {I r : r > 1}. Let {I s : s > 1} be pairwise disjoint 
sub intervals of I r . 

We apply (2.13), so that for x, y G I r , we have 



1^ ^P(T-T T n\ < 



\y 






(7 


(40 




irl 









P(/nV) < Ij I I j I + \ H i 1 ^) (V) - H (l Jon /c<r) 



X 



Let us for the moment assume that the second term on the right is dominant. Squaring the 
inequality above, averaging with respect to the measure u in both x and y, we obtain 

E(J r , uf? (J r ; l /o a) 2 < E^ } Ef^ (^p) ' P fr; W 

< Ef*%f ^ |tf (lz on /.a) (y) - if 
<4Ef^|iJ(l /on/ .a)| 2 . 



|2 
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Multiply the last inequality by u(I r ) and sum in r to get 

Y,oo(I r nir,io) 2 P(I r ;l Io a) 2 <J2 I \H (l IonI ca)\ 2 dw 

r>l r Ir 

< [ \H(l Io a)\ 2 doo + Cj2 [ \H(l Ir a)\ 2 dw 

J Iq r J Ir 

< 2H 2 a{I Q ) 

by the testing condition (1.3) applied to both I and I r . 

Returning to (2.15), it remains to consider the case where the first term on the right is 
dominant. By the same reasoning, we arrive at 

E(/ nW ) 2 P(J f ;l /0 ,f<EfEf 

cr(I r .) 2 



\y 




x\ 


2 

O 


(I, 


¥ 




i r \ 


2 


Ir\ 


2 



< 



12 



Multiply the last inequality by co(I r ) and sum in r to get 

v(V 2 

—ujyi r ) \ j\ 2 

r=l 



\J-r\ 

□ 



Remark 2.15. We refer to E(I,u) as the energy functional because in dimension n > 3 the 
integral 

J J \x — x'\ 2 n duj (x) dcu (a/) 

represents the energy required to compress charge from infinity to a distribution u on I, 
assuming a repulsive inverse square law force. When n = 1, the force is attractive and the 
integral 

J J \x — x'\ duj (x) dco {x) 
represents the energy required to disperse charge from a point to a distribution u> on I. 

2.4. The Hybrid Condition. We begin with a monotonicity property of energy, and then 
apply it to show that the Hybrid Condition implies the Energy Hypothesis. This Lemma 
helps clarify the role of the Hybrid Conditions. 
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Lemma 2.16. Fix < e < 2. Let I be an interval, and {I r : r > 1} a partition of I . We 
have the inequalities for < e < 1 — |: 



ujY. 



r>l 

2-2e-e 



V W (/ r )|/ r | 2 - 2£ E(/ r ; W r < sup (|£T) u(I )\I \ 2 - 2s E(I ;uy. 

The second inequality is obvious given the first; as it turns out this is the basic fact used 
to exploit the Hybrid Conditions for < e < 2, so we have stated it explicitly. 

Proof. The inequality is obvious for e = 0. We prove it for e = 2. This is rather clear if we 
make the definition 

Var?= uj{I) Wj^x-Wjxf . 

Then, we have u (I) |/| 2 E(7; oof = Var?. 

Second, variation of a random variable Z is the squared L 2 -distance of Z from the lin- 
ear space of constants. And u (Jo) |/o| 2 E(/o; w) 2 admits a transparent reformulation in this 
language: The random variable is x and the probability measure is normalized u measure. 
Moreover, 

J>(/ r )|/ r | 2 E(/ r ; W ) 2 

r>l 

is the squared L 2 -distance of x to the space of functions piecewise constant on the intervals 
of the partition {I r : r > 1}. Hence, the inequality above is immediate. 

For the case of < e < 2, we apply Holder's inequality and appeal to the case of e = 2. 

v 1/6 , \ 1/2 

^(/ r )|/ r rE(/ r ; W r < ^(/o) (2 " e)/2e hr W (/ r )|/ r | 2 E(/ r ;c^ 

K r>\ ) \r>l 

<a;(/ ) (2 - e)/2e a;(/o) 1/2 |/o|E(/ ,a;) 

which is the claimed inequality. □ 

Here is a Poisson inequality for good intervals that will see service both here and later in 
the paper. 

Lemma 2.17. Suppose that Jc/C / and that dist(J, e(/)) > ||J| e |/| 1 ~ e . Then 
(2.18) |J| 2£ - 2 P(J,al A/ ) 2 < |/| 2 - 2 P(/ )( 7l A/ ) 2 . 
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Proof. We have 

and (2 k j) n U \ iW requires 

dist(J,e(I)) < \2 k j\ . 
By our distance assumption we must then have 

iJfm 1 ^ < dist (J, e (/)) < 2 k | J\ , 

or 

2 - k < (^S 1 



I 



Thus we have 

which is the inequality (2.18). □ 
We can now obtain that the Hybrid Condition implies the Energy Hypothesis. 

Lemma 2.19. Let < e < 2. Then the functional 

^ (J, E) = u (J) E (J, io) e P (J, l^a) 2 

satisfies the three properties in (1.14) with < e < 1 — |. As a consequence, the Hybrid 
Condition (1.10) implies the Energy Hypothesis (1.16) with 7 = 2 — 2e — e. 

Proof. The first line in (1.14) is the usual A 2 condition, and the second line is the Hybrid 
Condition (1.10) with e = 7. Thus we must show the third line: 



]T $ (J,, T \ /o) < sup f i^H 7 vD (/ , J \ J c 



for all £-good subparitions {J r } of I , i.e. those satisfying (1.13). From Lemma 2.17 we have 

|J r | 2 - 2 P(J r ,al A/o ) 2 < |/ | 2 - 2 P(/ 0)C rl A/o ) 2 . 
Now use Lemma 2.16 and E(J r ,u) 2 < E(J r ,u) € to obtain 

J2®( J rJ\Io) = ^w(J r )|J r | 2 - 2£ E(J r ,w) 2 |J r | 2£ - 2 P(J r , ( Tlj ^ 2 



r>l r>l 



< ^ W (J r )|J r | 2 - 2e E(J r ,o;)Vo| 2£ - 2 P(/o,crl fUo ) : 



r>l 
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< sup (i^ij xa;(/o)E(/ ;a;) e P(Jo,(7l A/o ) 2 



r>l 



r>l \ Mq| / v 



□ 



3. Grids, Haar Function, Carleson Embedding 

This section collects some standard facts which can be found e.g. in [Vol]. We call a 
collection of intervals Q a grid iff for all J, J G Q we have I D J G {0, J, J}. An interval 
I £ G may have a parent /W; The unique minimal interval J E Q that strictly contains 
7. Recursively define = (iw)! 1 ). In the analysis of the paper, it will be necessary to 

distinguish the grid in question when passing to a parent. We accordingly set 

(3.1) ^(-O = The unique minimal interval J G Q that strictly contains I. 

Recursively set 7Tg +1 (I) = 7Tg(irg(I)) . Note that the definition of iTg(I) makes sense even if 
/ / Q. 

A grid T> is dyadic if each interval I G T> is union of G X?, with J_ being the left-half 

of /, and likewise for J + . We will refer to I± as the children of 7. 

A dyadic grid T>, with weight a admits the Haar basis adapted to a and T>. This basis is 
especially nice if the weight a does not assign positive mass to any endpoint of an interval in 
T>. This can be achieved by e. g. a joint translation of the intervals in P, and so it will be a 
standing assumption. 

The Haar basis {h^ : I G T>} is explicitly defined to be 

-q(/ + )l f _+q(/_)l f+ M/-M/+) / 1/- 1/ + \ 

J H/ + )M/_) + ( x(/_)M/ + )] 1/2 V V a(J_) + (7(J + );' 

with the convention that = if the restriction of <r to either child J_ or J + vanishes. The 
martingale properties of the Haar function are decisive, still at a couple of points, we have 
recourse to the formula 

(3 - 2) iwi-^S^vS- 

These functions are (1) pairwise orthogonal, (2) have a-integral zero, (3) have L 2 (cx)-norm 
either or 1, and (4) form a basis for L 2 (o~). We also define 



A/ / = (/, h°)a ■ hi 
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where by (*, ■) we mean the natural inner product on L 2 (a). We then have the L 2 (a) identity 
(3-3) / = 

lev 

for all / el 2 (a) that are supported in a dyadic interval 1° and satisfy L fda = 0. We 
remark that by a simple reduction in (17.3) of [Vol], we only need (3.3) for such / in the 
proof of our theorem. 

Note that (3.3) yields the Plancherel formula 

(3-4) ll/lli. W = X)l</' / »f>J a ' feL 2 (a),suppfcI°,[ fda = 0. 

T^v J i° 

The following simple identities are basic as well. We have 

(3.5) AJ/ = {IjWjJ+I^J} - hWjf. 

Consequently for two intervals I\ C h, h,h £ the sum below is telescoping, so easily 
summable: 

(3.6) *jf{x)=E* Il f-E%f, 

/:/igjc/ 2 

In these displays, we are using the notation 

EJ0 = ail)' 1 J (j) a {dx) , 

thus, Ej/ is the average value of / with respect to the weight a on interval I. 

We turn to a brief description of paraproducts. The familiar Carleson Embedding Theorem 
is fundamental for the proof. The proof follows classical lines, using that the map / — > Ej/ 
is type (00, 00) and also weak type (1, 1) with respect to the measure J2iev 011 ^ by the 
Carleson condition (3.9). 

Theorem 3.7. Fix a weight a and consider nonnegative constants {ai : / € V}. The follow- 
ing two inequalities are equivalent: 



(3.8) »/ :^/ 2 < c n i '" 2 



lev 



(3.9) ai - ° 2CT ( J ) ' 3 G I? - 

ieV-.icJ 

Taking C\ and Ci to be the best constants in these inequalities, we have C\ ~ C2 with the 
implied constant independent of a. 
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There is another language commonly associated with the Carleson Embedding Theorem. 
For the purposes of this discussion, let / = / x [0, |/|] be the square in the upper half-plane 
]R+ with face I on the real line, viewed as the boundary of This is called the box over I. 
And consider the linear map, a cr-weighted analog of the Poisson integral, 

Af{X,t) = ^( x -t/2,x+t/2)f ■ 

Given a measure /i on M. 2 ,, this operator maps L 2 (M, a) into L 2 (IR^,/i) if and only if the 
measure /i satisfies the Carleson measure condition 

The condition (3.9) above is a discrete analog of this condition, with the measure /i being 
defined by a sum of Dirac point masses at the center of the tops of the boxes over J: 

a 4 = qj^( CI ,iji) • 

lev 

In seeking to verify the Carleson measure condition (3.9), there is a store of common 
reductions. A very simple one is that it suffices to test (3.9) for dyadic intervals J for which 
aj + 0. 

A slightly more complicated one is this: Let S C T> be such that we have the estimate 

(3.10) KS)<Cxa(S ), S eS. 

ses-.scSo 

That is, the measure a has the Carleson measure property, provided one only sums intervals 
in S. Now, suppose that fi is a measure on IR^_ such that for any So £ <S, we have 

(3.11) fi[S \ |J S ) < C 2 a(S ) . 

Here, we have the box over So, and we remove the smaller boxes. Then, we have /i(S) < 
(Ci + C 2 ) &(S) for all S G S. We shall implicitly use this reduction. 

In the two weight setting, a paraproduct would be, for example, an operator of the form 

Tf = 52a I W I f-hy. 

lev 

By the orthogonality of the Haar system, it follows that T maps L 2 (a) into L 2 (u) if and only 
if the sequence of square coefficients {aj : / G V} satisfies the condition (3.9). This is the 
type of argument we will be appealing to below. 
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4. The Good-Bad Decomposition 

Here we follow the random grid idea of Nazarov, Treil and Volberg as set out for example 
in Chapter 17 of [Vol]. The first step in the proof is to obtain two grids, one for each 
weight, that work well with each other. There are in fact many dyadic grids in R. For any 
(3 = {A} G {0, 1} Z , define the dyadic grid to be the collection of intervals 



\ i<n / ) ni 



This parametrization of dyadic grids appears explicitly in [Hyt], and implicitly in [NTV2] 
section 9.1. Place the usual uniform probability measure P on the space {0, 1} Z , explicitly 

P(/3 : A = 0) = P(/3 : A = 1) = ^, for all I G Z, 

and then extend by independence of the A- Note that the endpoints and centers of the 
intervals in the grid are contained in Q dy + x@, the dyadic rationals Q dy = {fs"} mngZ 
translated by xp = J2i<o 2* A G [0, 1]- Moreover the pushforward of the probability measure 
P under the map — > xp is Lebesgue measure on [0, 1]. The locally finite weights u, a have 
at most countably many point masses, and it follows with probability one that u, a do not 
charge an endpoint or center of any interval in D^. 

For a weight u, we consider a random choice of dyadic grid D w on the probability space S w , 
and likewise for second weight a, with a random choice of dyadic grid T> a on the probability 
space S' 7 . 

Notation 4.1. We fix e > for use throughout the remainder of the paper. 

Definition 4.2. For a positive integer r, an interval J G T) a is said to be r-bad if there is 
an interval I G D w with \I\ > 2 r \I\, and 

dist(e(7), J) < IIJH/I 1 - 2 . 

Here, e(J) is the set of three points consisting of the two endpoints of J and its center. (This 
is the set of discontinuities of hj.) Otherwise, J is said to be r-good. We symmetrically 
define J G P u to be r-good. 

The basic proposition here is: 

Proposition 4.3. Fix a grid V" and J 6D U Then P (J is r-bad) < C2~ £r . 

Proof. Let I G T) a with the same length as J and \l D J\ > ||J|. Let s = [(1 — s) rj and 
consider the s-fold ancestor 71^1 of / in the grid T> a . We have 

dist(e (tt^I) , J) < 2 S \J\ < \J\ e \-K s Va I\^ £ . 
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In order that 

(4.4) dist(e(7r^/), J) < \\J\ e \Tx r v J\ l - £ , 

it would then be required that all of the further ancestors of / up to tt^I, namely vr^t 1 J, . . . , n^I, 
must share a common endpoint. Indeed, if not there is 1 < £ < r — s such that 

dist(e(7r^J), J) > dist(e(4+'J), J)>\ = 2 s+e - 1 \I\ 

> 2 S+1 1/| > \J\ £ 2 (1 - £)r > \ J\ £ Itt^/I 1 ^ . 

The essential point about the random construction of the grids used here is that for any 
interval K, K is equally likely to be the left or right half of its parent, and the selection 
of parents is done independently. But sharing a common endpoint means that 71^1 has to 
be the left-half, say, of 7r|£ I, for all t = s, . . . , r — 1. So the probability that (4.4) holds 
is at most 2~ r+s+2 < 2~ £r+2 . Now by definition, J is r-bad if at least one of the ancestors 
{^T^ k l}T = o & t or beyond 7r£, CT J satisfies 

(4.5) dist(e(vr^ fc J), J) < || J\ £ \^ k I\ l ~ e . 

The argument above shows that the probability that (4.5) holds is at most 2~ £{ - r+k ^ +2 . It 
follows that the probability that J is r-bad is at most 

oo oo o-er+2 

J2 2- £ ^ +2 = 2- £r+2 J2 2 ~ £k = — < C £ 2- £r . 

k=0 k=0 

which proves the Proposition. □ 

We restate the previous Proposition in a new setting. Let T> a be randomly selected, with 
parameter /3, and X? w with parameter /3'. Define a projection 

Pgood/ = i 
/ is r-good G£> CT 

and likewise for Pg Ood 0. We define P^ d f = f — Pgood/- The basic Proposition is: 
Proposition 4.6. (Theorem 17.1 in [Vol],) We have the estimates 

\\PLJ\\ LH , ) <C2-f \\f\\ L2(a) . 

and likewise for P^^. 
Proof. We have 

EviiPLd/nW) = E v E </'^> 2 

I is r-bad 
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□ 

From this we conclude the following: There is an absolute choice of r so that the following 
holds. Let T : L 2 (a) — > L 2 (u) be a bounded linear operator. We then have 

(4-7) imLv H L 2M < 2 sup sup E^|(TP^ ood /, P£ ood 0>J . 

ll/H^ (CT) =i |[0|| i2(w) =i 

Indeed, we can choose / G L 2 (a) of norm one, and G L 2 (u) of norm one, and we can write 

/ = Pgood/ + ^bad/ 

and similarly for 0, so that 

II^IIl 2 ( CT )^l 2 (w) = (TfA)uj 

< E^|(TP^ ood /, P£ ood 0)J + E^|<TP£ ad /, P^ ood 0)J 
+ E^KTP^/, P£ ad 0)J + E^KTPg^/, P£ ad 0>J 

< E^|(TP^ ood /, P£ ood 0)J + 3C ■ 2-/ 16 ||T|| l2(ct) ^ l2m . 

And this proves (4.7) for r sufficiently large depending on e > 0. 

This has the following implication for us: It suffices to consider only r-good intervals, and 
prove an estimate for \\H{o'-)\\ L 2^ .^ L 2^ that is independent of this assumption. Accordingly, 
we will call r-good intervals just good intervals from now on. 

5. Main Decomposition 

Fix (large) intervals 1° G T> a and J° G T> u , and consider the following modification of the 
'good' projections 

IeV" ,ICl° and \I\<2~ r \I \ ,1 good 

Likewise, define Pg Qod JO as above. We will prove that 
(5.1) (H(o-P° f), P" od)J o0> w < max{„4 2 , H, H\ 7 T)£ , F\ 



As this estimate will hold for all J°, J°, and all joint shifts of T> a and T>^ that avoid point 
masses at the boundary of intervals, this is sufficient to derive the Main Theorem 1.17. We 
also use here that the constant terms associated with the initial intervals 1° and J° in the 
expansion of / and respectively can be handled by the weak boundedness condition (2.8). 
This means we can assume the expectations Ej o / and Ej o both vanish. 

We may assume that Pg 0od/ o/ = /, and likewise for 0. From this point forward, we 
will only consider good intervals /, J. We suppress this dependence in the notation and 
we will clearly note the use of this hypothesis when it arises. Similarly we will only consider 
intervals /, J that contribute to the definition of the projections Pg 0od/ o and Pg 0odJ o, and 



TWO WEIGHT NORM INEQUALITY 



2:5 




Figure 1. The flow chart of the decomposition of the inner product (H (erf) , (ji) . 

suppress this fact in the notation. The role of 1° and J° permit the recursive constructions 
of the stopping intervals in Definition 6.8 below. 
Now, the inner product in (5.1) is 

i&v Jew" lev Jev^ 

— A l + A 2 i 
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where A\ = {(/, J) G V a x : | J| < |J|}, and we use the notation 

a*= £ (f>WAn{°K)M u {4>>K) u - 

The term A\ is the complementary sum. The sums are estimated symmetrically. Thus it 
suffices to prove (5.1) for the sum A\. Indeed, the starred constants do not enter into this 
estimate, but by duality will enter into those for A\, in which the roles of u and a are reversed. 

We shall follow the argument outlined in Chapters 18-22 of [Vol] by making several more 
decompositions, generating a number of terms Aj. These will be bilinear forms, but we 
will suppress the dependence of these forms on the functions / and 0. In this notation, the 
superscript % denotes the generation of the decomposition, and we will go to seven generations. 
The subscript j counts the number of decompositions in a generation. To aid the reader's 
understanding of the argument, a flow chart of the decompositions is given in Figure 1. It 
contains information about the proof, which we describe here. 

• The chart is read from top to bottom, with the root of the chart containing the inner 
product (H (a f ),(())„. 

• Terms in diamonds are further decomposed, while terms in rectangles are final esti- 
mates. The edges leading into rectangles are labeled by the hypotheses used to control 
them, A2, W, or T lfi in the figure. 

• There are three terms, A\, A® and the two from A\ for which the Energy Hypothesis 
with 7 > is essential. The edges leading into these terms are labeled to indicate 
this. 

• The horizontal dotted arrow from A\ to A\, labeled 'duality', indicates that A\ is 
controlled by the argument for A \, after exchanging the roles of / and 0. Accordingly, 
the final estimates in the dual tree will be in terms of the dual hypotheses, namely 
7"* e and W. 

• The edge leading into A\ is labeled 'Corona' to indicate that the Corona decomposition 
of §6 is used at this point. This is an important stage in the decomposition and one 
of the key ideas in [NTV4]. We modify it with the use of our Energy Hypothesis. 

• The edge leading into A\ is labeled 'Paraproducts' as all of the estimates in the fifth 
and subsequent generations use paraproduct arguments to control them. See §8. We 
organize the written proof to pass to the paraproducts first, with the other estimates 
taken up second in §9. 

We return to the main line of the proof of sufficiency. The collection of pairs of intervals 
A\ is decomposed into the collections 

(5.2) A\ = {(/, J) G A\ : 2- r \I\ <\J\<\I\, dist(J, J) < \I\} , 

A\ = {(/, J)eA\ : I J I < |/| , dist(J, J) >\I\}, 
A\ ee {(/, J)eA\ : I J I < 2-''\I\ , dist(J, J) <\I\}. 
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Using the notation of [Vol] and [NTV4], we refer to A\ as the 'diagonal short-range' terms; 
A\ are the 'long-range' terms; and A\ are the 'short-range' terms. 
We will show in §9.2 and §9.3, respectively, 

(5.3) \A\\<u\\f\\ L2{(7) u\y {u)) , 



1^1 ~ ^2 \\j ii L 2 (ct ) nriiL 2 H 

These inequalities are obtained in Chapters 18 and 19 of [Vol]. 

The term A\ is the important one, and will be further decomposed into 

(5.5) A\ = {(/, J) G W x V" : I J| < 2- r \I\ , In J = , dist(J, J) < |/|} 

(5.6) A 3 2 ee {(/, JjeD'xF : I J I < 2- r |/| , / n J ^ 0} . 

The 'mid-range' term A\ will be handled by a variant of the method used on the 'long-range' 
term, along with the A 2 condition. In particular, in §9.4 we prove 

(5-7) \A\\<^ 



y,e \\J |lL2( ff ) W\\L?(w) ■ 

Thus, A\ is the true 'short-range' term. It is imperative to observe that for (J, J) e A%, we 
must have J C /, for otherwise we violate the fact that J is good. 



6. Energy, Stopping Intervals, Corona Decomposition 

Our focus is on the short-range term, as given by (5.6), and it is here that our Energy 
Condition (1.9) will arise in place of the Pivotal Condition in [NTV4]. This is a critical section 
in this proof, and it has three purposes. First, to derive the Energy Lemma, and combine it 
with the Energy Hypothesis. Second, to make the definition of the Corona. Third, use the 
Corona to obtain the next stage in the decomposition of the short-range term. 

6.1. The Energy Lemma. As is typical in proofs that involve identification of a paraprod- 
uct, one should add and subtract cancellative terms, in order that the paraproducts become 
more apparent. Take a pair (J, J) e A\. Thus, J fl / 7^ and \ J\ < 2~ r \I\. But J is good, 
so that we have J C I, but not only that, we have 

dist(e(J), J) > |J| £ |/| 1_£ . 

Let I j be the child of / that contains J, and let I denote some ancestor of Ij. (The specific 
sequence of ancestors is to be selected below, in Definition 6.8.) We write 

(6.1) =(ff(l A/j aAf/),A^> w 

(6.2) +Ef J Af/.(fT(al f ),A^> u 

(6.3) -W Ij AH'(H((Tl 1K[j ),AW) u 
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As in [Vol] and [NTV4], we refer to the three terms in the last line as, respectively, the 
'neighbor' term, the 'paraproduct' term (called 'middle' term in [Vol] and [NTV4]), and the 
'stopping' term. Note that A°f takes a single value on Ij, which is exactly E^AJ/. 

Our analysis of the stopping term in (6.3) will bring forward the energy condition (1.9), 
and yields a more general inequality which we formulate in this Lemma. Recall that 

$(I,E) = u(I)E(I,u)*P(I,l E <r) 2 . 
Lemma 6.4 (Energy Lemma). Let J C I' C / be three intervals with 

(6.5) dist(dl', J) > \J\ 

(This follows from the good property for dyadic intervals, but we do not assume that any of 
these three intervals are dyadic.) Let $j be a function supported in J and with co-integral 
zero. Then we have 

(6.6) IhU^o]^ 



<c\\*j\\ mu ,*ui\r 



(w) 

The L 2 formulation in (6.6) proves useful in many estimates below, in particular in the 
proof of the Carleson measure estimate, Theorem 7.11. Indeed, we will apply (6.6) in the 
dual formulation. Namely, we have 



(6.7) 



H ( l fv ,<r 



E»jH[l Ar a 



Note that on the left, we are subtracting off the mean value, and only testing the L 2 (u) norm 
on J. 

Proof. For x, x' G J, and y G I\I', we have the equality 



Ul 



x — y x' — y \J\ 
We use (6.5) to estimate the second term by 

\J\ 



< 



(x - y)(x' - y) 



\J\ 



(x - y)(x' - y) - \y- Cj\* 
where cj is the center of J. 

Turning to the inner product, the fact that <3>j is supported on J and has w-mean zero 
permits us the usual cancellative estimate on the kernel. This familiar argument requires the 
selection of an auxiliary point in J, and we use the measure u to select it. We have 



i\i'/ 
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E' 



u>(dx') 



j Ji\r 



1 



< 



E 



VM \X — X 



\J\ 



x — y x' — y 
\$j(x)\ u{dx) ■ P(J,l Tv ,a 



$ j(x) cr(dy) cu(dx) 



< 11$ 



oj(dx') 



M \X — X 



J\\L 2 (u>) 



\J\ 



1/2 



u(dx) 



This completes the proof. 



□ 



6.2. The Corona Decomposition. We now make two important definitions from [NTV4]: 
'stopping intervals' and the 'Corona Decomposition'. This is the main point of departure for 
our proof. But first we recall the notation introduced in (1.12), (1.15), 



u(I)E(I,u) 2 P(I, l E af 



%, E (I,E) = sup inf 



'=U>1 Js 



S>1 



J, 



X 



$>u,£) 



S>1 



where the supremum is over all £-good subpartitions {J s } s>1 of /, and the Energy Hypothesis 
(1.16) 



r>l 



where 7 > 0, e > are fixed. Recall that $ appears in the Energy Condition (1.9) and 
in the dual Energy Estimate (6.7), while the larger functional \I/ 7)E appears in the Energy 
Hypothesis (1.16). The key properties required of \1/ are given in (1.14), and result in the 
crucial off-diagonal decay of \I> relative to $ in Theorem 7.11 used to estimate term A^, as 
well as the estimates for the term Af and the stopping term A\ in Subsection 9.1. 

Definition 6.8. Given any interval Iq, set S(Iq) to be the maximal V a strict subintervals 
S ^ Iq such that 

(6.9) % >£ (S, J ) > 4J- 7 >(S'), 

The collection S(Iq) can be empty. 

We now recursively define Si = {I }, and Sj + i = \J SeS . S(S). The collection S = Ujli $j 
is the collection of stopping intervals. Define p : S — >■ N by p(S) = j for all S G Sj, so that 
p(S) denotes the 'generation' in which S occurs in the construction of S. 
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Remark 6.10. It is worth emphasizing that we will not have a uniform inequality of the 
following nature available to us: 

%, e (S,I )<a(S). 

In a similar, but different direction, one might be tempted to make the simpler definition of 
a stopping interval that it is a maximal subinterval S ^ Iq for which one has 

$(S,I ) = E(S,u) 2 P(S, l lQ a) 2 u{S)>A£ 2 a{S). 

This simpler condition does not permit one to fully exploit the Energy Hypothesis. 

We now define the associated Corona Decomposition. 

Definition 6.11. For S G S, we set V(S) to be all the pairs of intervals (I, J) such that 

(1) I EV a , J EV", J C I, and \J\ < 2~ r \I\. 

(2) S is the S-parent of Ij, the child of I that contains J. 

Note that A\ = Us e <s^( 5 )> wh 

ere A\ is defined in (5.6). Let C a (S) to be all those I G T> u 
such that S is a minimal member of S that contains a V a -child of I . (A fixed interval I can 
be in two collections C a (S).) The definition of '^(S) is similar but not symmetric: all those 
J G such that S is the smallest member of S that contains J and satisfies 2 r \J\ < \S\. 
The collections {C a (S) : S G S} and {^(S) : S G S} are referred to as the Corona 
Decompositions. Note that 5cD" and C a (S) dV^forSeS while C U (S) C V w for S G <S. 
We denote the associated projections by 

(6-12) PS/= (/. fc /»f. 

iec°(S) 

and similarly for P£0. Note that P£ projects only on intervals J with \ J\ < 2~ r |S'|. 
We have the estimate below that we will appeal to a few times. 

(6-13) Ell P ^lli»<2|l/ll! V ) 

ses 

There is a similar inequality for P£ which we will also use. 

Remark 6.14. In the definition of the stopping intervals, we are using the functional \I/ 7)e 
associated with the Energy Hypothesis (1.16). Thus the stopping intervals can be viewed as 
the enemy in verifying (1.16). 
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6.3. The Decomposition of the Short-Range Term. To conclude this section, our esti- 
mate of A\ defined in (5.6) combines the splitting (6.1), (6.2), (6.3) and the Corona Decompo- 
sition. Namely, the Corona Decomposition selects the intervals I that appear in (6.1) — (6.3) 
according to the following rule. Recall that 

A\ = {(/, J)eV a xV w : J C / and | J| < 2~ r \I\}. 

Here, we have used the fact that J is good to make the condition defining A\ more explicit, 
LeJcI and 2 r \J\ < 

Definition 6.15. Given a pair (J, J) G A\, choose I G S to be the unique stopping interval 
such that I j G C a (I), where Ij is the child of I containing J. Equivalently, I G S is 
determined by the requirement that (/, J) G V(I). 

Note that if Ij ^ <S, then I D I, while if Ij G S, then I is the child of / containing J. Thus 
J is a function of the pair (I, J). With this choice of / in the splitting (6.1), (6.2), (6.3) we 
obtain \A^\ < X]j=il^jl where 

(6.16) A\= J2 (HiliM.^f)^^)^ 

(6-17) A t = J2 E Ef j AJ/-( J ff(l s a),A» U) 

ses (i,j)ev(s) 

(6-18) ^3 = E E EJ 7 AJ/-<^(l 5Vj a),A» w 

ses {i,j)ev(s) 

Recall that the three terms above are the neighbor, paraproduct, and stopping terms respec- 
tively. 

The paraproduct term A\ is further decomposed, while we will prove in §9.5 and §9.1 
respectively, 

(6.19) \A\\ < A 2 

(6-20) \A 3 \ < T y i£ {l j || L 2 (o .) w\\ui( u ) 



L 2 (<r) Wr\\L 2 (uj) > 



7. The Carleson Measure Estimates 

This section is devoted to the statement and proof of several Carleson measure estimates, 
designed with the considerations of the next section in mind. We collect them here, due to 
the common sets of techniques used to prove them. 

The following technical Lemma encompasses many of the applications of the Energy Hy- 
pothesis and the stopping time definition. For an interval J G P" , let 

(7.i) r>0= Yl <M^>^- 

j'eD" : J'cJ 
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Note that this projection Pj is onto the span of all Haar functions hji supported in the 
"D^-interval J. By contrast, P5 projects onto the span of those Haar functions hj with J in 
the corona C w (S) where S is a stopping interval in the T> a grid. 

Lemma 7.2. Fix an interval Iq G V a and let Iq G S be its S-parent. Let {I r : r > 1} C T> u 

be a strict subpartition of Iq. For r > 1, let {J r<s : s > 1} C T>^ be a subpartition of I r with 
\J r , s \ < 2 _t |/ r | for all r, s > 1, where t > r is the integer of Definition 4-2. We then have 

( 7 - 3 ) E \\KMH\iS)\L u) S 2-^^(Jo) . 

r,s>l (UJ ' 

Proof. We apply (6.7), and (1.16) to deduce the Lemma. We begin with (6.7) to obtain 

E \\n,MH\^)\L , s E $ ( J -' r ° u 

r,s>l ^ r,s>l 



< ^$(j r ,„/ \/o) + E <l> ( J ^' / o\ / -)' 

r,s>l r,s>l 

where the last inequality follows from the definition of $ and 

If I 7^ Jo, we estimate the sum involving I \ I using the fact that {J r ,s} rs> i is an e-good 
subpartition of Iq (because the intervals J TyS are good). We can thus use the third line in 
(1.14), and then the fact that (6.9) fails when Iq ^ Iq, to obtain 

£*(j P , a ,/o\/ ) -{^(^f) 7 }^ ( J o,/o) <2-^ 7 >(/ ). 

Next, to estimate the sum involving J \ I r , we use the fact that {J r ,s} s >i is an e-good 
subpartition of I r for each r (again since the intervals J TyS are good). We can thus use the 
third line in (1.14), and finally the Energy Hypothesis (1.16) to obtain 

$ (Jr, s , /oU)<E { s ^p (irf) "I ^ (/r ' /o) ~ ^KM^)- 

r,s>l r>l *■ — \ I r l / J 

This last estimate also proves the case Iq = Iq G S. □ 
Theorem 7.4. We have the following Carleson measure estimates for S G S and K G T> a : 

(7.5) ^ ^ \° ^ and E o{S)<a(K), 

s'es{S) s&s-.s^k 

(7-6) \(H(lsa),h^}J 2 <{^ £ + H 2 )a(K). 

J&C u (S):J<zK,2 r \J\<\K\ 
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Remark 7.7. The Corona Decomposition and this last estimate can be compared to a general 
strategy for proving Carleson measure estimates. The Corona Decomposition is reminiscent 
of the sets in (3.11); the condition (7.5) can be compared to (3.10); and the condition (7.6) 
can be compared to (3.11). 

Proof of (7.5). Concerning the second inequality in (7.5), as is well known, it suffices to verify 
it for K = Sq G S. And this case follows from the recursive application of the estimate first 
half of (7.5) to the interval So and all of its children in S. 

So we turn to the first half of (7.5). The intervals in the collection S(Sq) = {S r : r > 1} 
given in Definition 6.8 are pairwise disjoint and strictly contained in Jo. Each of them satisfies 
(6.9), so we can apply (1.16) to see that 

(7.8) tT(S) = Y,<S r ) 

SgS(So) r>l 

<7^E*7, £ (^,£o)<KSo)- 

□ 

Proof of (7.6). Fix S G S and K, which we can assume is a subset of S. If we apply the 
Hilbert transform to o~1k, a s opposed to trig, we have by (3.4) for u and (1.3), 

£ \{H{l K a)^)J < I \H(l K a)\ 2 u(dx)<n 2 a(K). 

JGC"(5) Jk 
JCK,\J\<2~ r \K\ 

And so we consider the Hilbert transform applied to als\x, and show 

(7-9) £ \(H(l sv< a),h-j)f < T\ e a{K) . 

Jec u (S) 

JCK,\J\<2~ r \K\ 

We can assume that K C S, and that there is some J G ^(S) with J C K. From this we 
see that K was not a stopping interval. That is, the interval K must fail (6.9). 

Let J denote the maximal intervals J G C W (S) with J C K and \J\ < 2~ r \K\. Using the 
notation of (7.1), we can use (7.3), with I' — K, I — S, and J G J . It gives us 

£||P^(l sv ,a)||i 2H < £ $(J,S\K)< Tl e a{K) . 
Jej Jej 

The second inequality uses the fact that K fails (6.9). This proves (7.9). □ 
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The following Carleson measure estimate, along with §9.1 and §8.5, are the three places 
where the Energy Hypothesis is used in this proof: It will provide the decay in the the 
parameter t in (7.12). For all integers t > 0, we define for S G S, which are not maximal, 

(7.10) at (S) = 

S':ir%(S')=S 

Here, we are taking the projection H(al w i^\ s ) associated to parts of the Corona decompo- 
sition which are 'far below' S. We have this off-diagonal estimate. 

Theorem 7.11. The following Carleson measure estimate holds: 
(7.12) a t (S)<2-^^ £ a(K), KeV°. 

S&S : W^(S)CK 

The implied constant is independent of the choice of interval K and t > 1. 

Remark 7.13. In the estimate (7.12), we draw attention to the fact that the dyadic parent 
7r v a (S) °f & appears. Similar conditions will arise below, and it is essential to track them 
as the measures we are dealing with are not doubling. In fact, the role of the dyadic parents 
is revealed in the next proof: Use the negation of (6.9) when n^a (S) S, and otherwise use 
the Energy Condition. 

Proof. Our first task is to show that 

E a t (S)<2~^ £ o-(S), SeS. 
s<=s(s) 

For the purposes of this proof, we will set S t (S) = {S' G S : vr^(5") = S}, using this 
notation for S G S(S). We want to apply (6.7) to the expressions at- To this end define 

(7.14) J(S') = { J G ^(S) : J is maximal w.r.t. J C S', \J\ < 2~ r \S'\} . 

It follows by definition that we have \J\ < 2~ r |S"| for all J G J{S'). And, as all Haar 
functions have mean zero, we can apply (6.7). From this, we see that 

<*t(s)< E E $ ( J >£\ 5 )> 

s'eSt(s) Jej{S') 

and so by the third line in (1.14), 

E s E E E ^s\s) 

ses(s) se<s(S) S'eSt(S) jgj(s>) 

< 2-*" *(S,S)<2- t ^ £ a(S), 



P" s ,H(al„ l(s) \ s ) 
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where the final inequality follows from the assumed Energy Hypothesis (1.16). 

Now fix K as in (7.12) and let S G S be the stopping interval such that K G C U (S). Let 
Qi = {Si} { be the maximal intervals from S that are strictly contained in K. Inductively 
define the (k + l) st generation Gj.+i to consist of the maximal intervals from S that are strictly 
contained in some k th generation interval S G Qk- Inequality (7.15) shows that 

«,(£)< 2-^ 7 2 £ 5>(S). 

We also have from (7.5) that 

oo 

fc=l Seg k SeGi 

This will be all we need in the case K = S, but when K ^ S, we will use Lemma 7.2 to 
control the first generation intervals S in Q\. 

£ oct(S) < 2-*V (JO • 

Indeed, we simply apply Lemma 7.2 with J = S, Jo = K, {I r } r> i = Gi, and {J r , s } >i = 

Us'es t (5) )■ 

When K S we finish with 

oo 

^ ^(5) = E a '( 5 ) + E £ a t (S) 
ses :7t 1 T)IT (s)ck seGi k=i seg k+1 

oo 

< 2-*V (JO + 2-*" J- 7 2 B £ £ a (S) 

k=i seg k 

and when K = S we set C/o — {$} an d estimate 

oo oo 

E = E £ s 2 " <7 ^. E E * (*) 

5g5 : 7r^ a {S)cs k=o seg k+1 k=o seg k 

□ 

We need a Carleson measure estimate that is a common variant of (7.5) and (7.12). Define 
(7.16) P(S) = " 
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Theorem 7.17. We have the Carleson measure estimate 
(7-18) P(S) < (H 2 + Fl)a(K) 

Proof. Using the decomposition 7i% CT (5) = S U {n^ a (5)\5} , we write (3(S) < 2(f3i(S) + f3 2 (S)) 
where 

h(S) = ||P^(al5)|| 2 L2H , 

A(5) = 11^(^(^)1^. 

We certainly have 0i(S) < T-La(S), so that by (7.5), we need only consider the Carleson 
measure norm of the terms ^(S). 

Fix an interval K of the form K = tt^(Sq) for some So G S. Let T be the maximal intervals 
of the form ttj,(S) C K, and for TeT, let S(T) be all intervals 5 G S with 5 C T and 5 is 
maximal. Using the notation of (7.14) and (7.1), we can estimate 

E E £ E E E 1 w^(5,\5) 

TeT seS(T) TeT seS(T) Jej(S) 

< ^AK) 

Here, we have have been careful to arrange the collections T, S(T) and J(S) so that (7.3) 
applies. 

We argue that this inequality is enough to conclude the Lemma. Suppose that S' G 5, 
with 5" C K, but S' is not in any collection S(T) for TeT. It follows that S' C 5 for some 
5 G S(T) and TeT. This implies that the Carleson measure estimate (7.5) will conclude 
the proof. □ 

A last Carleson measure estimate needed arises from the quantities 

2 



(7.19) 7 (5) 



Theorem 7.20. We have the estimate 



(7-21) E 7(5) < • 

Proof. We can take .K" = vr^, CT (5 ) for some 5 G 5, and in addition, we can assume that 
K ^ S, because otherwise we are applying the Hilbert transform to the zero function. 
We repeat an argument from the previous proof. Details are omitted. □ 
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8. The Paraproducts 

We continue to follow the line of argument in [Vol] and [NTV4] using similar notation for 
the benefit of the reader. The paraproduct term A\ is the central term in the proof. In 
this section, we reorganize the sum in (6.17) according to the Corona Decomposition: The 
essential point that must be accounted for is that for J G C W (S) and J C /, we need not 
have I G C a (S). On the other hand, it will be the case that / G C <J (7r^(5')) for some ancestor 
TTg(S) of S. The ancestor 7Tg(S) is only defined for 1 < t < p(S). (See Definition 6.8 for the 
definition of p(S).) In fact, the sum splits into A\ = A\ + A\, where 

(8-1) ^f = E E E^AJ/-(F(l s( 7),A» w , 

SeS (I,J)eP(S) 

jec w (S) 

P(S) 

(8.2) A\= J2 E E E^Af/- (ff (l^^), A5f0) 

ses\{/ } t=i (/,j)eP(4(S)) 
Jec w (5) 

In Af, we are treating the case where both I G P CT and J are 'controlled' by the same 
stopping interval. (J is not 'very far' below /, as measured by the stopping intervals S.) 
And, the point in the last line is that we are summing over J G C W (>S), while the pair 
(/, J) G P(7r^(S')), where ^(S) denotes the t-fold parent of S in the grid S, see (3.1). This 
ancestor appears in two places, controlling the sum over /, and in the argument of the Hilbert 
transform. 

We will prove 

(8-3) \Al\<CH + ^, £ )\\f\\ LH(T) U\\ L2H , 

while A\ will require further decomposition. 

8.1. A\\ The First Paraproduct. We use the telescoping sum identities (3.5) and (3.6) to 
reorganize the sum in (8.1). Fix S G S and J G C"(S). The sum over / in (8.1) is (6.12). 

(8-4) E E lA r7 if = E l J J-Kv( S) f- 

I ■. (i,J)er(S) 

Here, we set to be the minimal member of C a (S) that contains J, and satisfies 2 r \J\ < |/|. 
Such an interval must exist as J is good. Thus, we can write 

(8.5) A5 = 5>{(S), 

ses 

(8-6) Al(S)= E (^IJ-^-imsv),^)*. 

The basic estimate here, and our first paraproduct style estimate is 
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Proposition 8.7. We have the estimates 

(8.8) \Al(S)\ <(^ + ^ 7 , e )||P5/-l5E^ (s) /|| L2(ff) ||P^|| i>) , SeS. 
Here, the projections on the right are defined in (6.12). 

Proof. We should reorganize the sum in a fashion consistent with paraproduct-type estimates. 
For / G C CT (S), let 

Qf<f> = e 

jec^{sy.ij^=i 

Using the Cauchy-Schwartz inequality, and the fact that EJf = EJPg/ and = QjP^ 
we see that 



\Al(S)\ 



iec°{S) 



< 



E i E ? p 5/-^ (5) /MiQ^(i5^)iii 2H E iiqtoi&m 

/eC CT (5) /eC^(5) 

1 1/2 

< IIPg^M E ( p 5/ - ^C^)/)! 2 ■ IIQ/#(i^)ll 



I 



In view of the Carleson Embedding inequality, namely (3.8) and (3.9), this last factor is at 



most 



P%f - lsK v (S)f 



times the Carleson measure norm of the coefficients 



But by the Plancherel formula (3.4) this is what is shown in (7.6) to be at most a constant 
multiple of H + J- lt£ , so the proof is complete. □ 

To complete the estimate for A\, from (8.5) and the observation that the projections on 
the right in (8.8) are essentially orthogonal, see (6.13), we can estimate 

\a\\ <(u + T n , E ) E ||ps/ - M^/H^j IIp^IIl^h • 

ses 

<(u + r 7 , E ) (E \\ p sf - isKv {s) f\\ 2 LHa) E ll p 

\ses ses 



1/2 



< 
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since we have 

ElM^Cw = E^I^Wl 2 

ses ses 

< ^(S)(E%\f\) 2 

ses 

< \\MJ\\l 2{a) <\\f\\ 2 L2{(T y 

Here, we should make an appeal to (7.5) in order to conclude that the maximal function M. a 
dominates the sum in (8.10). This is (8.3). 

8.2. The Remaining Paraproducts. We repeat the analysis of (8.4), but for the term A\ 
defined in (8.2). Fix J, which must be a member of C"{S) for some S G S \ {/°}. The sum 
over / in (8.2), as it turns out, is only a function of this S, and equals 

p(S) 

(8.12) A 5 2 (S)=J2 E E^AJ/.(#(1 4(S)( 7),A^ 

t=i (i,j)ev(^ s (s)) 
JeC"(5) 

P(S) 



t=i JeC u (S) 



We argue as follows. With J G C W (S) fixed, the sum over I such that (I, J) G "P^^S")) 
is only a function of 5 and t, and is a sum over consecutive intervals in the grid V>° . The 
smallest interval that contributes to the sum is 7r|, CT (vt^ -1 (S)) , the second dyadic parent of 
tt^ (S), and the largest is lOjy, {ftsiS))- (Recall Definition 6.11. Also, these two intervals 
might be one and the same.) 

In (8.12), the sum over J is independent of the sum over t. In the next steps, we concentrate 
on the sum over t. Below we add and subtract a cancellative term, to adjust for the second 
parent in (8.12). 

P(S) 

(8.13) AUS) = BH^B))/ " K h , { „<-«s>)f + ^.(''fW - E W)>-« 

t=l 

x H(l n t s(s) a) 
= A 5 2l (S) + A\ 2 {S) , 

P(S) 

(8.14) A^S) S £ W - ' 



(8.15) ^(5) = E K ct(4 - 1(5)) / - ^ ctH(5)) /) • H(l^ s{s) a) . 
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The term Al 2 (S) in (8.15) is itself a telescoping sum, and so we can sum by parts to write 

P(S) 

(8.i6) aUs) = K lAS) f ■ h(k U s)°) + E E : iCT (W ■ H ^ns)vm^ ■ 

Note that there is one term missing, but it has the expectation E CT p{S) ^^/ = E^ /, where 

1° is the largest interval that we fixed at the beginning of the proof. In particular we have 
assumed that this expectation is zero. 

We combine these steps, specifically the definition of A\ in (8.2) and the identities (8.13), 
(8.14), (8.15), and (8.16) to write A\ = A\ + A% + Ajj, where 

A\ = »* = 1.2,3, 

Se<S\{/°} 

p(s) 

(8-18) At(S) = E^ (5) / • (^(l^^), , 

P(S) 

Of these three expressions, the first A\ has cancellative terms on both / and (f>, hence it is 
not (yet) a paraproduct as such. The second A\ is a paraproduct, one that is very close in 
form to that of Af, compare (8.6) and (8.18). The third term is a paraproduct, but looking at 
the support of the argument of the Hilbert transform, one sees that it is also degenerate, and 
we should obtain some additional decay in the parameter t, the 'miraculous improvement of 
the Carleson property' in Chapter 21.3 of [Vol] - see (8.21) below. We take up these estimates 
in the next subsections, passing from more intricate to less intricate. 

In fact we will prove in §8.3, §8.4 and §8.5 respectively, 

(8-19) \Al\<^,e\\f\\ LH(T) H\\ LHuj) , 

(8.20) \At\<(n + ^ £ )\\f\\ L2{a) u\\ L2{uj) , 

\A\\<{u+^ h£ ) imi L>) . 

In particular, the Energy Hypothesis enters into (8.19). 
8.3. The Term A® . Let us fix t, and define 

ses ■. p(S)>t 

Here, we impose the restriction p(S) > t so that the t-fold parent of S is defined. 
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The estimate we prove is 

(8.21) \Al(t)\ < T+T^ ||/|| L2(ff) ||0|| L2M , f > 1 . 

The constant e = 7/2 > 0. Clearly this proves (8.19) after summation on t > 1. 
The projections P£ are orthogonal, so we have 



39 



.22) \A*(t)\ < U\\ L2{uj) 



2 



1/2 



E r^(4(s))-^ 

SG5 : p(5)>t 

Recalling the notation (7.10), the sum on the right in (8.22) is 

m 1/2 

.SeS 

Therefore, to prove (8.21), we should verify that the Carleson measure norm of the coefficients 
{att(S) : S G S} is at most C2 _7 *J r 7)£ . But this is the content of Theorem 7.11, and so our 
proof is complete. 

8.4. The term A\. We certainly have is\) a {S) C 7Tg(S), so that it is natural to split term in 
(8.18) into two, namely writing tt^(S) = 7r^, a {S) U {^(^Vr.p^S')}, to give us 



5.23) \A[\ 



(8.24) \A\\ 



ses 



<(H + ^, e ) \\f\\ L2M 



{a) \m\l?(u) ' 



ses 



7,e \\J \\L*(<r) W\\L 2 (oj) ' 



Together these prove (8.20). We treat them in turn. 
Recalling the notation (7.16), we estimate 



P^(l 4(5) a),P^ 



</3(5) 1/2 ||P^|| i2M • 

The latter projections are mutually orthogonal so we can estimate 



1 1/2 



.ses 



L3(w) 



7,^ II -/ 1 1 1,2 (a) \m\L*{u) ■ 

We have appealed to the Carleson measure estimate (7.18) to get the 11/11^2^ term. This 
proves (8.23). 
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The argument for (8.24) is similar. Recalling the notation (7.19), we have 



<7(S) V2 ||P' 



We estimate 



141 < 



.565 



1/2 



~ ? 7,e IM llL2(cr) IIVIlL2 (w ) 



We have appealed to the Carleson measure estimate (7.21) to get the H/H^^) term 



8.5. The Term A\. In the definition of A\{S), see (8.17), note that the difference of expec- 
tations depends upon a single Haar coefficient, the one for the dyadic interval ^x><t(^s(S)). 
To be explicit, we will have the following equality 



EZ 2 r ( S )f -Kl„(s)f 



SeS 



We reindex the sum defining A\ as follows. From (8.17), we write 



P(S) 



-AlW = E E ^(4- 1 W) A i(4- 1 W) / ■ ( H ^Us)^ W = 4(S) + Al(S) 



t=i 



P(S) 



(8.25) A 7 3 (S) EVr, . t -i ( ^A% . t _ lft -J ■ (H(l 



4(s)\4~ 1 (s)° r )' p ^ 



t=l 



.26) 



P(S) 



Al(S)=Y,Ki 
We argue that 



ses-{s } 



7,e IU IIL2 ((T ) imiL2(w) ' 



(8.27) 



£ ^) 



5e5-{5 } 



7,s IM llL2( CT ) \m\l?(w) 



Indeed, the first inequality (8.26) is easier than the argument for (8.19), due to the extra 
orthogonality present with the Haar difference applied to / in (8.25). We omit the proof. 
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We turn to the proof of (8.27), and will need to appeal to our Energy Hypothesis again. 
Begin by reindexing the sum. We define 



(8.28) 



s'es 
n t - 1 (S')=s 



<2-^ 2 J- 7 , £ ||/|| L2((T) ||0|| L2H , t>l. 



(The decay in t is slightly worse in this case than in others.) Indeed, we first exploit the 
implicitly orthogonality in the sum. Note that we will have 



El</,/*v (5 ,u 2 < 



2 



E 

ses 



E ^< 

S'eS:n t - 1 (S')=S 



< 



We also have from (3.2), that 

^ CT( 5)^ ( 5)|<i^(^r 1/2 

Combining these facts, we see that (8.28) follows from the estimate 



5.29) 



E 

s , eS:TT t - 1 (S')=s 



< 2-^(H 2 + 7?J|7r^(5)| , S e S,t > 1 . 



We turn to the proof of this last estimate. We will need geometric decay from two different 
sources. One is the geometric decay in (7.5), and the second is the application of the Energy 
Hypothesis, as in the proof of Theorem 7.11. Fix S G S, and integer u ~ and let S u be 
those S' e S with ir%(S') = S. We have 



£ P 5^(l4-(5)^) 

5'e5:7r t - 1 (5')=5 



E 



S"eS u 



B(S') 



E 

S"e5:7r t - 1 -«(S")=5' 



PS„F(1 4 - 1(5) <7) 
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Now, in the definition of B(S'), we adjust the argument of the Hilbert transform, writing 
B(S') = B 1 (S') + B 2 (S"), where 

2 



Bx(S') = 



B 2 (S>) 



E 

S"eS:TT t - 1 -^(S")=s / 



P S"- H '( 1 4- 1 (S)\S' or ) 



P&'Hils,*) 

S"eS:TT t - 1 -"(s")=s / 



Now, by the testing condition (1.3), we have 

where we have appealed to the Carleson measure property of the measure a on the stopping 
cubes, more precisely (7.8), to deduce the last line. This proves half of (8.29). 
We use the notation (7.14), and apply (7.3) to see that 

X>(^=E E E iwrn^s^m^) 

S'&Su S'£Su 5"e5:7r t - 1 -«(S")=5' J£J{S") 

This completes the proof of (8.29). 

9. The Remaining Estimates 

We collect together the estimates claimed in earlier sections. The estimates in the first 
two subsections below are in [Vol], and the remaining three subsections essentially follow the 
arguments in [Vol] but using the Energy Hypothesis in §9.1. 

9.1. A|: The Stopping Terms. To control (6.18), and prove (6.20), it is important that 
we are dealing with the Energy Hypothesis (1.16). 
We first claim that for S G S and s > an integer 

(9.1) Al(S,s) = 



(I,J)eV(S):\J\=2- s \I\ 

<2-^^ £ F(S)A(S,s 
iec°{S) 



ElA°f-(H(l SVj a),A»<j ) ) o 



E KMS) t 

iec°(S) J ■. (i,J)er{sy.\J\=2' s \i\ 
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Indeed, apply Cauchy-Schwarz in the I variable above to obtain, and appeal to (3.2) to see 
that 



A\(S,s)<F(S) 



E 



E 



1 



l€C°(S) \J : (I,J)eV(S):\J\=2- s \I\ 

We can then estimate the sum inside the braces by 



E E \^ h "U 2 

IeC(S) J : (I,J)£P(S):\J\=2-s\I\ 

' [ [H(l SVj a),hj 



E 

J : (I,J)eV(S):\J\=2- s \I\ 

<A(S,s) 2 -A(S,s) 

E a(Ij)- 1 -\(H(l SVj a),h-<f ) ) u 

J : {I,J)£V(S):\J\=2- S \I\ 



A(S, s) = sup 

/GC CT (5) 



We turn to the analysis of the supremum in last display. We denote the two chilren of I 
by 1$ for 9 G { — , +}. Using (6.6) and then the third inequality in (1.14), we have 



A{S,s) < sup sup a(I e ) 1 

iec°(s) o &{-,+} 



E *(J,S\I e ) 
j ■■ (i,j)£V(sy.ij=i e \j\=2- ! >\i\ 

< sup sup a{I e )- l 2-^ 1)e {h,S) 
IeC(S) 6»e{-,+} 



7>£ 



The third inequality is the one for which the definition of stopping intervals was designed to 
deliver: From Definition 6.11, as (J, J) G V(S), we have that S is the 5-parent of Ij, hence 
Ij was not a stopping interval, that is (6.9) does not hold, delivering the estimate above. 

We clearly have from (3.4) that 

E^) 2 <Ek/'^i 2 = 



2 



ses 



And so we have from (9.1), 



E E A t( s > s ) z ^ imiz» E E 2 ~ 7SA ( 5 > s ) 



1/2 



< T 



7,e \\J llL2(a) \m\L?{u) 
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9.2. A\: Diagonal Short Range Terms. To prove (5.3), let us recall the definition (5.2). 
The pairs of intervals J, J arise from the dyadic grids T> a and T>^ respectively. But these grids 
share a common set of endpoints of the intervals. And the intervals /, J have comparable 
lengths, 2~ r |/| < \ J\ < Accordingly , these pairs of intervals satisfy the conditions of the 
weak boundedness condition (2.7). A Haar function h a j is a linear combination of its children, 
and the children of / and J also satisfy the weak boundedness condition (2.7). From this, 
we see that 

\(H(ahl),h"j)J<4W, (I,J)eAl 
The Schur test easily implies that 



J : {I,J)EA\ 



< w||/|| i2(CT) u\\ L2{w) 



9.3. A\: The Long Range Term. We prove the estimate (5.4). Recall that the pairs of 
intervals /, J in question satisfy |J| < |/| and dist(J, J) > \I\. The hypotheses of (6.6) are in 
force, in particular (6.5) holds. 

We observe that the Energy Lemma can be applied to estimate the inner product (H(h^a), h l j) u 
To see this, note that is constant on each child I±. So, take a child Ig, and apply the 
Energy Lemma with the largest interval / taken to be 



hull 



IeA V\ 



J 



Here A J means the interval with the same center as J and length equal to A|J|. The two 



intervals Ig and (^yjjj J are disjoint. We take I' C / so that I\I' = Iq. Then, the Energy 
Lemma (6.6) and (3.2) apply to give us the estimate below. 



l-£ 



(9.2) 



(3(1, J) 



< 



e 

uj(jy 



<\w Ie h°\Y,\{mie°),hf) L 



E 



E(J,u)P(J,l Tv ,a) 



/2 



\J\ 



dist(J, J) 2 ' 



We have used the trivial inequalities E(co>, J) < 1 and P(J, l/ e cr) < 
We may assume that H/H^/y 



\J\ 



dist(/,J) 



■T(Ig). 



I J : ] J\<\I\:dist(I,J)>\I 



l 2 {ui) = 1- We then estimate 

|</XU/3(/,J)KMj)J 
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J J : |J|<|/|:dist(7,J)>|7| 

-5 



dist(7, J) 2 

/ J : |J|<|/|:dist(/,J)>|/| V 1 1 7 V ' 7 

+e™ 2 e 

J I : |J|<|/|:dist(/,J)>|7| V 1 1 7 V ' > 

where we have inserted the gain and loss factors (jj^j with < 5 < 1 to facilitate appli 



cation of Schur's test. For each fixed / we have 



J : |J|<|/|:dist(7,J)>|/| 



oo 



s^E^I E ^hy^ 

k=0 \ J : 2 fe |J| = |7|:dist(7,J)>|/| V ' ' 



X 



E 



\j\ 



which is bounded by 



fc=0 

if 5 > 0. For each fixed J we have 



dist(/, J) 2 

J : 2 fe |J| = |7|:dist(/,J)>|/| V ' ' 



t,— n VII / 



/ : |J|<|/|:dist(/,J)>|7| 



E. . (^"^^^ 



fc=0 / : 2 fc |J| = |/|:dist(/,J)>|/| V ' 1 



oo 



fc=0 \/ : 2 fe |J| = |/|:dist(7,J)>|/| 
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which is bounded by 



/ i \ 

w (J)i^2--('-«P(2'j,.)i^j 



fc=0 

if 5 < 1. With any fixed < 5 < 1 we obtain from the inequalities above that 

1^1 < Ek/'^i 2 ^ + Ei^^)J 2 ^ 

/ J 

Ua) + U\\W))^-2A 2 \\ff L2{a) Uf LHu]) , 

since we assumed ||/||£a (<T) = ||0||l2 M = 1. 

9.4. AfThe Mid-Range Term. We control the term associated with (5.5), namely we prove 
(5.7). For integers s > r, set 

= E E i</> t^), J 

/ J : 2 s |J| = |/|:dist(/,J)<|/| , /nJ=0 

2n 1/2 



< " ' U 2 (o-) 



E E k#(^U"Um^ 

I \J : 2 s \J\ = \I\:dist{I,J)<\I\ , /nj=0 



<A(5) ||/|| i2(CT) ||0|| x2(w) , 



I s IIJ.II 2 

\L 2 (uj) ■ 



A( S ) 2 = 2*sup \Wh^lh u jU 2 

1 J : 2 3 |J| = |7"|:dist(/,J)<|/|, /[~|J=0 

since, by (3.4), 

E E |(0,^)J 2 = 2^^|(0,^)J 2 = 2^ 

/ J : 2 s |J| = |/|:dist(/,J)<|/| , 7nJ=0 I J 

Due to the 'local' nature of the sum in J, we have thus gained a small improvement in the 
Schur test to derive the last line. 

But J is good, so that (6.6) applies to each child I± of I as in (9.2) above. Hence, we have 
using (2.18) that 

A( S ) 2 <su P 2*£ ^-E^.f-PUV) 2 

1 e J :2 s \J\ = \I\:dist(I,J)<\I\, lnj=9 

~ T r £ E gg(g)~p(/.,i,..)' 

J : 2 s |J| = |/|:dist(7,J)<|/|, /nj=0 Vl 17 



TWO WEIGHT NORM INEQUALITY 47 

<su P 2*2-e-*>y^ £ W (J) 

7 6» 1 1 J : 2 s |J| = |/|:dist(/,J)<|/| , /nJ=0 

< 2- (1 - 2£)s A . 

This is clearly a summable estimate in s > r, so the proof of (5.7) is complete. 

9.5. Af: The Neighbor Terms. The neighbor terms are defined in (5.6), (6.1), (6.16), and 
we are to prove (6.19). To recall, I G V a , J G V^ 1 is contained in /, with |J| < 2~ r |/|, and 
I j is the child of / that contains J. 

Fix 9 G { — , +}, and an integer s > r. Below we will use the convention that I\Ig = 
The inner product to be estimated is that in (6.16): 

(H (li_ g crAjf) , Aj(f)) u = ((^^/^(^A^)), 

= EJ_ 9 A^-((l J _ 9 (7, J ff(a;A^)} (J 

Use ||A^0|| L2(w) = \{(f>,hj)J and = 2~ s in the Energy Lemma with J C I e C I to 
obtain 

| (# (lj_ e a) , A^> w | < | (0, /#)J w ( J)' • E (J, w ) • P (J, l^rr) 
< | (0, h«j) J u, ( J)'- (i-g, lj_ e a) 

Here, we are using E (J, to) < 1 and (2.18), which inequality applies since J C I \ I-e- 

In the sum below, we keep the length of the intervals J fixed, and assume that J C Ie- We 
estimate 

Ai(I,e,s)= Kff(l/_^Af/),A^) w | 

J : 2°\J\ = \I\:JCl e 

< 2-^m_ e A^ I f\P(I 9 ,l I _ e a) £ | (0, J) 1/2 

J : 2 S |J| = |7|:JC/ S 

<2-( 1 - £ ) s |EL e AJ/|P(/ e ,l / _ e a)u;(/ e ) 1 / 2 A(/,^, S ), 

J : 2 s \J\ = \I\:JCl e 

The last line follows upon using the Cauchy-Schwartz inequality. 
Using (3.2), we have 

|EL e A-/|<|(/,^> CT |.a(/_ e )-^ 
And so, we can estimate Af (J, 9, s) as follows, in which we use the A2 hypothesis (1.6): 
Ai(I, 9, s) < 2-^1 (f, h°) a \A(I, 9, s) ■ a(/_ e )- 1/2 P(/ e , l^a^h) 1 ' 2 
< A 2 2-^\(f,h°) a \A(I,9,s), 
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since P(Ie, li_ e o~) < shows that 

/ t \-i/2 D /r i rr\, jt W 2 < a ( I -8) l,2uj {.h) 112 < . 

<t{1-o) P{lg, li_ g a)u{l e ) < r— : < A 2 , 

\ie\ 

A straight forward application of Cauchy-Schwartz then shows that 

4(1, e, s) < A 2 2-^ s \\f\\ LH(7) || A(J, 6, s) || L2M . 

This estimate is summable in 9 G {— , +} and s > r, so the proof of (6.19) is complete. 

10. A Counterexample to the Pivotal Conditions 

We exhibit a weight pair (u>, er) that illustrates the nature of the Energy Condition, and 
the subtlety of the two weight problem in general. In particular it shows that the Pivotal 
Conditions are not necessary for the two weight inequality (1.2). 

Theorem 10.1. There is a weight pair (u, a) which satisfies the two weight inequality (1.2), 
and fails the dual Pivotal Condition, namely (1.7) with the roles of lo and a reversed. 

Thus, this pair of weights satisfy the two weight inequality, but would not be included 
in the analysis of [NTV4]. We prove this result by appealing to our Theorem 1.17. In the 
course of the construction, we will see that one can make seemingly small modifications of 
the example measure a, and in so doing violate the L? inequality. 

The plan of the proof of the Theorem is to (1) construct the pair of weights, and then 
to verify (2) the assertions on the Hybrid Conditions, (3) the A 2 condition and (4) the two 
testing conditions (1.3) and (1.4). We take up these steps in the subsections below. 

10.1. Construction of the Pair of Weights. Recall the middle-third Cantor set E and 
Cantor measure u on the closed unit interval If = [0, 1]. At the k th generation in the 

construction, there is a collection {i"*}^ of 2 k pairwise disjoint closed intervals of length 

| Jj 1 ! = With K k = [J f =1 /*, the Cantor set is defined by E = |Xi K k = iXi (Uf=i if) ■ 
The Cantor measure u is the unique probability measure supported in E with the property 

f to 2 fe 

that it is equidistributed among the intervals ji"*}.^ at each scale k, i.e. 

W (J*) = 2" fc , k > 0, 1 < j < 2 k . 

We will define three measures a, a, a. We denote the removed open middle third of I k by 
G). The three measures, restricted to an interval G k will be a point mass with weight that 
is only a function of k. The only distinction will be the location of the point mass. 
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Let Zj G Gj be the center of the interval G*, which is also the center of the interval 7j\ 
Now we define 

(10.2) a Yl ■ 

where the sequence of positive numbers s k is chosen to satisfy the following precursor of the 
A 2 condition: 

The self-similarity of this measure makes it useful in verifying the counterexample. But, it 
appears that the pair of weights (u, a) do not satisfy the two weight inequality (1.1). 

The construction of the other two example measures is closely related to the structure of 
the function Hu. On each interval G\j, Hu is monotonically decreasing, from oo at the left 
hand endpoint of G k , to — oo at the right hand endpoint. In particular, Hu has a unique 
zero z° k . And this selection of points define a as in (10.2), namely 

Of course, we gain a substantial cancellation in the testing condition (1.4) by locating the 
point mass at the zero of Hu. 

We then define the third measure a by taking e G\ to the unique point so that Hu(zj) = 
(3/2) fc . We then can easily check that the L 2 inequality for (u,o~) does not hold: 

w^)=f:i:(f|y=oo. 

J k=l j=l ^ / 

The weight pair (u, a) can be seen to satisfy the A 2 condition, the forward testing condition 
(1.3), but fail the backwards testing condition. Thus, this pair of weights provides an alternate 
example to those provided in [NaVo] and [NiTr]. We will not further discuss the measure a. 

We can calculate the rate at which Hu blows up at the endpoints of the complementary 
intervals. The rate is a reflection of the fractal dimension of the Cantor set. 

Lemma 10.3. Write G^ = (a*, 6^). We have 

Hu(a) - c3- k ) ~ (3/2) fe , k > 1 , 1 < j < 2 k , 

and a similar equality holds for bj. (Implied constants can be taken absolute; signs will be 
reversed for b h .) 
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This in particular shows that the zeros z k cannot move too far from the middle: 

(10.4) sup ' 5 jl <C<1- 

The points z k satisfy a similar inequality. This indicates the sensitivity of the two weight 
inequality to the precise definition of the measures involved. 

Proof. Fix k, and consider the numbers Hu{ah — c3~ fc ) for 1 < j < 2 k . These are monotoni- 
cally increasing as the point of evaluation moves from left to right across the interval [0, 1]. 
So we should verify that 

(10.5) C x (3/2) k < Hu{a k + c3- k ) < Hu(a k k + c3- k ) < C 2 (3/2) k 
We consider the right hand inequality. Writing 



k i „o-An 



Hoj(al k + c3 



u(dy) 



< 



(Gj^c a* fc + c3 k -y 



a k k + c3 k - y 

Here, we have discarded that part of the domain of the integral where the integrand would 
be negative. Now, on the interval [0, a k k ], the support of u is contained in the set U^=i ^-r 
Using this, we continue the estimate above as 

k 

Hu(a k k +c3- k )<J2uj(I £ 2e _ 1 ) sup * 

fe-i 



It is useful to note for use below, that in this sum, the summand associated with k = £ is the 
dominant one. 

We consider the left hand inequality in (10.5). We split the support of to into the sets 
I k , I* l\ ~\ ■ ■ ■ , By the argument above, we have 

fc-i 

Y,H(ul It2 )(a k + c3- 



< A(3/2) ? 



1=1 

where A is absolute, and we have yet to select c. Then, we have 



f 1 1 

#M/fu/|) = / — ^z~ k k — r-i , ^Q-fc — u ( d y} 

1 2 Jjk Oj — c3 k — y a K - — (1 + c)3 k — y 
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For < c < (2 A) -1 , we conclude our Lemma. □ 

10.2. The A2 Condition. We verify that the usual A2 condition holds for the pair (00, a). 
Due to the property (10.4), this same argument will apply to the measures & and '&. The 
starting point is the estimate 

l\ k I ' 2\ k 00 /2\ 2fc /2 x2 ^ 

3 



From this, it follows that we have 

a(I k )u{lf) s k u{I k ) 



;io.6) 



\Tk\2 \T k \ 2 
I 3 I I 3 1 



The analogous condition for the Poisson or strengthened A2 condition also holds. Indeed, 
using the uniformity of u, one can verify 

P (r* , A < u ^ 

Pf/ £ a) < V— ^ k<V— - ^ <(-] <^ 

m=0 ^ ' m=0 ^3 ) ^ ' ' r ' 

From this and (10.6), we see that 

p(i r V)p(/ r V)<i. 

Let us consider an interval 7 C [0, 1], and let k be the smallest integer such that z k G AI. 
Here A > 1 is a large constant, dependent upon the constant in (10.4). We note that j is 
unique. For j < j', it follows that for some j" we have z k , < z^T 1 < z k . In particular, we will 
have cr(AI) ~ v(G k ). Let us also assume that G k C AI. Let I k ~ x D It follows that we 
have 

P (I, U ) P (I, a) < P (I k ~\ u) P [l k r -\ a) < ^1^1 < 1 . 
The last case is G k 2 A/. We then have 
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The last inequality follows from the definition of Zj, and fact that we must have dist(7, dG k ) > 
\I\, provided A is sufficiently large. We then have P (/, u) < 2~ k -^L _ And so we can estimate 

s k I T\ 



3 I 

2- k s k 
< L < l 

10.3. The Pivotal and Hybrid Conditions. In this section, we show that the weight pair 
(u, a) fails the dual Pivotal Condition, namely the Hybrid Condition with e = and the 
roles of uj and a reversed. But, they satisfy the Hybrid Condition for all < e < 2, and the 
dual Hybrid Condition for e < e < 2 for some e < 2. 

10.3.1. Failure of the Pivotal Condition for e = 0. Failure of the Pivotal Condition is straight 
forward. Indeed, If C C ... C 1° and so 

P ( G «, u ) » p (n , «) « e MM® » e |^2- « (?) ' , 

fc=0 Mi I fc=o ^ ' 

and similarly 

P(G& W )«P(# W )«QY, allr. 
Considering the decomposition [J er G e r C [0, 1] we thus have 

EI^Lp(«^r,E<i)'g)'ffl^E-oc, 

which shows that the dual Pivotal Condition, the one dual to (1.7), fails. 

10.3.2. The dual Hybrid Condition for large e. Next we show that the dual Hybrid Condition 

oo 

^a(/ r )E(/ r ,a) e P(J r ,l /o u;) 2 < (^) 2 u;(/o), 

r=l 

holds for all eo < e < 2 where 

In 2 2 

We need this estimate, which shows that with Energy, we can get an essential strengthening 
of the A 2 condition. 
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Proposition 10.7. For e > e and any interval I C [0, 1], we have the inequality 
(10.8) o-(I)E(I;o-yP(I;uj) 2 <u(I). 

Proof. We can assume that E(J; a) ^ 0. Let k be the smallest integer for which there is a 
r with z k G /. And let n be the smallest integer so that for some s we have z k+n G I and 
z s +n 7^ z r- We can estimate E(J; ex) in terms of n. Namely, we have 



< 



2 x n 



Indeed, the worst case is when s is not unique. Then, there are two choices of s - but not 
more. Let z k J rn G /, where s' ^ s. Then, note that we have 

|/-{**}L f^ v 



a(I) \9 / 

and this leads to the estimate above, remembering the characterization of Energy as a variance 
term. 

Next we note that a(I) « (|) , > 2~ k ~ n , and P(I;u) ~ (f) • This specifies every- 
thing in (10.8), so we choose e so that 



2 \ ft / 2 \ 3 ^ 



<2 



-k—n 



9J \9J \2 t 

This inequality will be true for all pairs of n, if eo < e < 2 where 

1 



□ 

It is now clear that the pair of weights (u, cr) satisfy the dual Energy conditions S* for 
eo < e < 2. Let J C [0, 1] and let {/ r : r > 1} be any partition of 1$. We appeal to (10.8) 
to see that 

J2 *(ir)E(I; ^) e P(/; ^) 2 < = ^( J o) • 

r>l r>l 

10.3.3. The Hybrid Condition for positive e. It remains to verify that the pair of measures 
(a;, a) satisfy the Hybrid Conditions for all < e < 2. We will establish the pivotal condition 
(1.7), i.e. £ < oo, which then implies that £ e < oo for all < e < 2. For this it suffices to 
show that the forward maximal inequality 



;i0.9) J M{fafdu<C J \f\ 2 da 
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holds for the pair (ou, a), and (10.9) in turn follows from the testing condition 

(10.10) J M {l Q af duj < C J da, 

for all intervals Q (see [Sawl]). We will show (10.10) when Q = If, the remaining cases being 
an easy consequence of this one. For this we use the fact that 

(10.11) M M (x) < C (?) , xeEnif. 

To see (10.11), note that for each x G If that also lies in the Cantor set E, we have 
M(l 4 a)(x)< sup w da* sup ^ ^ «(-). 

Now we consider for each fixed m, the approximations u/ m ) and fx*™) to the measures to and 
a given by 



du^(x) = ^2- m j±-l Ir (x)dx, 



i=l 

(T (m) = > > .s' ]<).!.. 



E E »X 

k<m j=l 

For these approximations we have in the same way the estimate 

£ 2" 



M (i/<o- (m) ) (x) < C (?) , xejjir. 



Thus for each m > 1 we have 

f M(l^) 2 du^ < C J2 (ff 2 



t:JJ»CJ£ 



= C2 m ^ (?) 2~ m = Csf *C J^da. 

Taking the limit as m -)• oo yields the case Q = If of (10.10). This completes our proof of 
the pivotal condition, and hence also the Hybrid Conditions (1.10) for all < e < 2. 

10.4. The Testing Conditions. As an initial step in verifying the forward testing condition 
(1.3) for the pair (u,a), we replace a by the self-similar measure a, and exploit the self- 
similarity of both measures u and a: 

uj = - Bih uj H — Trans 2 Dili uj = oji + u 2 , 

2 3 2 33 
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2 , 2 
a — — Dili a + Si H — Transg Dili a = o"i + Si + & 2 . 

93 29 33 2 



We compute 



J\Ha\ 2 tu = J H^& 1 + 5i+& 2 S j 2 ^i + J H(& 1 + Si+& 2 ^j 



cu 2 



+ 



y |/Jcr 2 | 2 W 2 | 



l-Hail uj 1 + \H& 2 \ to 2 > +TZ, 



where the remainder term 1Z £ is easily seen to satisfy 



V < A 2 

'^e r^e 



since the supports of Si + a 2 and u\ are well separated, as are those of Si + b\ and lu 2 . For 

2 2 

this we first use (a + b) < (1 + e) a 2 + (l + i) b 2 to obtain 



y if (<7i + 5i + cr 2 

< y (|if(<7i)| + |# (*i+<7 2 

< J{(l + e)\H (a^ 2 + (l + ^) |# (*i + *>) f} wi, 



and then for example, 



|ff(<7 2 )| Wi 



< 



[1 ,1] y-x 
%([f,i])MMD 



< 9 



1 1 [0,1] 

<t([0,1]M[0,1] 



I [0,1] 



*2 < Al I &. 



But now we note that 



|ff o"i| 2 u)\ 



\Ha 1 (x) 1 2 Dili u(x) 



H& 1 



x 




U> (X) 



-(- 

2 V 9 



\H& (x)\ 2 u{x) = ■ 
and similarly f \H& 2 \ 2 ^2 — §/ \H&\ 2 u. Thus we have 



3 

-(- 

2 V 9 



\H&\ 2 u, 



u> (x) 




Z X 

3 3 



a(z) 



oj (x) 
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(10.15) J \H&\ 2 u = l(l + e ) J \H&\ 2 u + (1 + e) J \H&\ 2 u + TZ £ 

and provided j \H&\ 2 co is finite we conclude that 



\H&\ 2 u 



1 



l-S(l+e) 



a 



for e > so small that 1 — | (1 + e) > 0. 

To avoid making the assumption that j \Ho\ 2 u is finite, we use approximations as follows. 
For each fixed m > 1, consider the approximations u/ m ) and to the measures to and d 
as in (10.12). We have the following self-similarity equations involving a/" 1 -* and d*-" 1 -* that 
substitute for (10.13): for m > 2, 



(m) 



to 



9 3 

As above we compute that 
\Ha {m) \ 2 u {m) 



- Dih J™-^ + ~ Trans 2 Dili u {m ~ l) = u{ m) + 

2 3 2 3 3 1 /5 

Dili d (m ~ 1} +5x+- Trans 2 Dih d^ 1 ) = <rj m) + 5i + 



5 9 



• ( m ) 



where the remainder term 7?1 satisfies T^e" 1,1 < e .A| (J d), and also that 



(m) 



(m) 



#df } 



F(d{ m) + 5i +df } 



' M 

^2 



Fdi 



(m) 



(m) 



1 

2 
1 

2 

1 /2 



#dj m) 



Dih a; {m - 1} (x) = - 
a v ; 2 



#dj m) 



a;^- 1 ) fx) 




*-§9 i 



2 V9 
1 f2 




1 



£ _ 

3 3 



a; (m " 1} (x) 



u {m ~ 1] (x) 



9 / \Ha {m - 1) {x)\ 2 ^ m - l) {x) = - / Itfd^-^lV™- 1 ) 



and J 



Ho. 



» 



2\9J J 1 y 71 v y 9 

( m) = |J lifd^-^V™- 1 ). Thus we have 

y |#d (m) |V m) = i(l +e) y ji/d^-^fw^- 1 ) +^ m) , m>2. 
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Iterating this equality yields 



\Ha {m) \ 2 u {m) =1-0 +■:) 



Q(l+^)) m / l^r^+^Qa + e)^^), m>2, 



from which we obtain 



\Ha^\ 2 ^< e Ali Id 



m > 2, 



with a constant C independent of m. Taking the limit as m — > oo proves J \Ha\ 2 u> < 
C e A\ (j a) < oo. 

This completes the proof of the forward testing condition (1.3) for the interval 7 = [0, 1]. 
The proof for the case 7 = I* is similar using 1Z £ (7^) < C e A\ ^ f Ik a j , and the general case 
now follows without much extra work. 

Remark 10.16. The self-similarity argument above works on the forward testing condition 
because the central point mass 5i is a significant fraction | of the mass of & and is well 
separated from the measure oj at all scales. This accounts for the fact that a mere fraction | 
of the left side of (10.15) appears on the right side. This argument fails to apply to the two 
weight inequality (10.5) itself because the measure fa need not have a significant proportion 
of its mass concentrated at the point | . 

Having verified the forward testing condition for the weight pair (u, a), we show that the 
forward testing condition (1.3) holds for (u),a). For this, we estimate the difference 

2 



771 T t (a — &)\ oj 



it 



Li 



E 



X — Z: 



X Zi. 



co (x) 



1 1 



E 



4 



z) - z{ 



(x - 4) (x - z{) 



cu (x) 



< c 



1 1 



E 

(k,jy.zf?ei* 



b 3 



|7* 



\x — Zi 



uj [x] 



In the last line, we have used (10.4). Now for any fixed x in the support of u inside 7f, we 
have 



E 



x — z- 



E 



E 



I k 

I 3 I 



m -° (k,j):z^4 and I x-z k . I ^3" m 1 1* I 



3 \ u I 

F _ 3 I 



58 M.T. LACEY, E.T. SAWYER, AND I. URIARTE-TUERO 



< 



E E 

m -° (k,j):z!?el*. and U-«f'|»i3- m |l*| 



3 J \3J V(3- m - £ ) 2 



1 \ / ^ 

m=0 k>£+m 
-v / * \ fc / q \ 



E E 2' 

m=0 fc>^+', 
oo 

E E 

m=0 

oo 

E 



3/ \3/ V(3- 



m=0 k>l+m 



_4 y /9 

27/ V2 



m=0 



Thus we get 



which yields 



2^ /„\ 2-6 



This is the case 7 = 1% of the forward testing condition (1.3) for the weight pair (a;, a), and 
the general case follows easily from this. 

Finally, we turn to the dual testing condition (1.4) for the weight pair (u, a). For interval 
II and Zj G If,, we claim that 

(10.17) \H(ljiU,)(**)\<P(lt, U ). 

To see this let If -1 denote the parent of 1^ and denote the other child of If -1 . Then we 
have using Hui (z 3 k ) =0, 

= -H (z$ 

= ~H (4) - H (h U u) (zi) . 

Now we have using H (to) (zf) = that 
H (4) = H (l {l e- r u) (4) - {# (l {l e- r u) (4) - H (l {li - r u?j (zi)} 

= -H (l^u) (z e r ) - A, 

where 



A = H ( 1/jt-iyU 



(4) - H (l {li - r u) (zi) 
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Combining equalities yields 

H (1 4U ) (z{) = H (4) +A-H (4) . 

We then have for (k, j) such that G I*, 



\H{hi-^) (4)\ < 
\A\ 



< 



\ifr l \ ' 
i i 



(in 
H i 1 ^) (4) 



ry /yd, rp /yk 



LO (x) < / — tjUJ (x) 

J(it-iy\x-z{-v 2 v ; 



which proves (10.17). 

Now we compute using (10.17) and the estimate for P (i^uS) above that 

(\H(i^)\ 2 da = Yl \ H MH)\ 2 s)<c Yl \ p (i»\ 2 * 



2 „* 
j 



This is the case / = of the dual testing condition (1.4) for the weight pair (w, cr), and the 
general case follows easily from this. 
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